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PREFACE 


Preface 

The aim of this book is to present fundamental concepts in quantum mechanics and a general mathematical 
formalism beyond the wavefunction framework taught in introductory quantum mechanics courses. This includes 
topics such as Dirac formalism with bra- and ket-vectors in Hilbert space, Heisenberg formalism with matrices, 
approximation methods in quantum mechanics, scattering theory, atoms and electrons in magnetic fields, coherent 
states, field quantization and radiation theory, and the density matrix formalism. In addition to explaining the 
underlying theory in a detailed manner, we shall also provide a number of examples that will illustrate the 
formalisms "in action". 

This book is primarily based on my lecture notes from teaching this class to undergraduate students, and the notes 
in turn are based on the book "Kvantemekanikk" by P. C. Hemmer. I have also included additional topics and 
instructive examples which hopefully will allow the reader to obtain a more thorough physical understanding of 
the material. This book is suitable as material for a full-semester course in intermediate quantum mechanics at the 
undergraduate level. 

It is my goal that students who study this book afterwards will find themselves well prepared to dig deeper into 
the remarkable world of theoretical physics at a more advanced level. I welcome feedback on the book (including 
any typos that you may find, although I have endeavored to eliminate as many of them as possible) and hope that 
you will have an exciting time reading it! 


Jacob Linder (jacob.linder@ntnu.no) 

Norwegian University of Science and Technology 
Trondheim, Norway 
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I. GENERAL FORMULATION OF QUANTUM MECHANICS 

Learning goals. After reading this chapter, the student should: 

• Be able to understand and use Dirac’s bra-ket notation. 

• Know the fundamental axioms of the general formulation of QM. 

• Know how the general formulation and the wavemechanics formulation of QM are related. 


Introductory quantum mechanics (QM) utilizes a position-representation where one works with wavefunctions 
\p = ip(r). However, this is in fact just a special case of a more general theory. The general theory is important 
because some QM systems cannot be treated by wavefunctions in position space, such as the spin degree of 
freedom. We therefore develop the foundation for the general theory in what follows. 


A. Dirac’s bra-ket notation 

We introduce a new formulation where a QM state is described by a state vector |^) in a complex linear vector 
space namely the so-called Hilbert space. The Hilbert space may have a finite or infinite dimension, and 
in often cases the latter. For instance, we need infinite Hilbert spaces to represent a vector describing continuous 
variables (such as position). In contrast, only a two-dimensional Hilbert space is required to describe a single 
spin-1/2 state. We will show this explicitly later on. For now, you may simply think of as the space where 
the state vector | 'ip) resides. Mathematically, is required in order to perform operations such as inner products 
between state vectors in a well-defined manner. 

There are different notations which are used for the state vector. A common convention is to denote the state with 
its quantum numbers. For instance, stationary states in a Coulomb-field would then be written as | nlm), where 
{n, /, m} are the quantum numbers characterizing the eigenstates of the system (as treated in introductory courses 
to quantum mechanics). Generally, the state vector may also depend on time. In what follows, we usually suppress 
the /-dependence notation-wise unless it is of importance. 
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GENERAL FORMULATION OF QUANTUM MECHANICS 


For any state | a) in there is assigned a dual vector (a | in a dual vector space. The relation between the two 
state vectors is that the scalar product 

<a| • \b) = (a\b) (1.1) 

is defined as a complex number with the property 

(a\b) = ( b\a )*. (1.2) 

The two states | a) and | b) are orthogonal if their inner product (a\b) = 0. The notation used here is due to Dirac 
and known as bra-ket notation: 


(... | = bra, |...) = ket. (1.3) 

If a vector is multiplied with a number c, the corresponding dual vector must be multiplied by c*. To see this, let 
| a') = c\a). It follows that (b\a f ) = c(b\a) and thus 

(a'\b) = (b\a')* = c*(6|a)* = c*(a\b). (1.4) 


It is then clear that (a'\ = c*(a\. 

In a n-dimensional vector space, we may choose n linearly independent vectors |1), |2),... | n) as basis vectors 
and expand an arbitrary state vector |-0) in these: 


n 

W«5>l*>, (!-5) 

k = 1 

where Ck are complex numbers. Assume for simplicity that these basis vectors are orthonormal, so that (k\m) = 
4m- We allow the dimension n to not necessarily be finite. It follows that c m = (m^), so that we may write 

I^> = HW)I & )- u- 6 ) 

k 

In turn, this can be written as 

m = 5>x*i- w (i.7) 

k 

(we simply interchanged the position of (fc|^) and | k) which is fine since (k\^) is a scalar) which means that we 
must have 


J2\k)(k\ = l- ( 1 . 8 ) 

k 

This is the so-called completeness relation which will turn out to be very useful. The corresponding relation for 
usual vectors in three dimensional Cartesian space can in fact be written in a similar fashion: 

^ = = 1? (1*9) 

k 

because using this operator on a vector A is equivalent to the identity operation: 

^■x^x H - &yAy + C Z A Z = A. (1.10) 

While (a \ b) is the inner product and equal to a complex number in general, the outer product of the vectors | a) 
and | b) is \a)(b\ and is generally equal to an operator. For instance, \k)(k\ is a projection operator that projects a 
state vector onto the |fc)-axis. 

Some basis vector sets {| k)} are such that k takes on continuous values. Then, we replace the summation with an 
integration and also a delta function normalization: 

(k\k f ) =5{k-k'). (1.11) 
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The expansion of a state vector \ijj) using such basis vectors then takes the form 

|^) = J dk c(k)\k). 

Multiplying from the left with (k'\, we find the expansion coefficients 

(k 1 \ip) = J dk c(k)(k’\k) = J dk c(k)S(k' - k) = c{k'). 


We may thus write that 


\i>) = j dk{k\i,)\k). 

The completeness relation for continuous variables then takes the form 

J dk\k)(k\ = 1. 

The norm 11/| | of a vector |/) is defined as 

ll/ll = VU\f)>o. 


( 1 . 12 ) 

(1.13) 

(1.14) 

(1.15) 

(1.16) 


We can see that ||/|| is always real and non-negative by using the completeness relation we derived. It follows 
from the property: 


(/i/) = E^i fc )W) = Eiw)i 2 (i-i7) 

k k 

since (f\k) = ( k\f)*. 


B. Operators and eigenvectors 

An operator in Hilbert space is an image of on itself. This means that the operator A assigns a vector |c) to 
any vector | a) according to: 

A\a) = | c). (1.18) 

The adjoint operator .4 '’ is defined by 

(a\A'\b) = (b\A\a)* (1.19) 

which must hold for any two vectors |o) and |6) in Jf'. By setting A\a) = |c), we may write Eq. (1.19) as 

(a\A^\b) = (b\c)* = (c\b). (1.20) 

It then follows that (c| = (a\A^. We have thus shown that the dual vector of A\a) is (a\A^. The following 
properties of the adjoint operation follow from our definitions so far (try to prove them yourself!) 

• (At)t = A 

• (aA)^ = oAA^ where a is a constant 

• (AB) t =B^Al 

An operator is self-adjoint (also known as Hermitian) if 

A f = A. (1.21) 

It follows that for such operators 

(a|A|a) = (a|A|a)* (a|A|a) G (1.22) 

We define an eigenvector of A to be |a) where 

A\a) = \ a \a). (1.23) 

The number A a is the eigenvalue. The collection of eigenvalues for the operator A are known as the spectrum of 
A. An important observation is that: 
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Eigenvalues of Hermitian operators A are real. 


This follows since A a = ^ as shown above. Representing a physical observable with a Hermitian 

operator thus guarantees that the corresponding eigenvalues are real, as they should be for a measurable quantity. 
The set of eigenvectors (|a)} for an operator corresponding to a physical quantity is assumed to be a complete set. 
This means that such eigenvectors may be used as basis vectors. 


C. The axioms of the general formulation of QM 

The general formulation of QM, which we have now established the notation for, is based on the following 
postulates. 

A: To any observable quantity F, one assigns a linear Hermitian operator F in Hilbert space. The operators of 
a generalized coordinate q n and the corresponding generalized momentum p n satsify the commutation relation 

[Qn 5 Pn\ — 

B: The state of a physical system is described by a state vector \^(t)) in a Hilbert space. It has the property 
= 1 and satisfies the time-dependent Schrodinger-equation 

m\il’(t))=H\iKt)). (1.24) 


Here, H is the Hamilton operator. 

C: The expectation value of an observable quantity F in the state |-0) is (F) = (^IFI^). 

D: The measurement of an observable quantity F yields as a result one of the eigenvalues f n of the operator F. 
An observable quantity is defined as a property of the system’s state which may be determined by performing 
physical operations on the system (such as subjecting a charged particle to a magnetic field and reading off its 
position). 
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GENERAL FORMULATION OF QUANTUM MECHANICS 


With these postulates, we can now describe QM with different sets of basis vectors. To begin with, we will 
look at this in more detail using the position representation and establish how this is related to the wavefunction 
formalism in introductory courses of quantum mechanics. 


D. Different representations 

We start with the position representation and consider motion only in ID, in order to keep the notation simple. The 
eigenvectors of the position operator x are denoted \x') where x' is the eigenvalue: 

x\x r ) = x'\x'). (1.25) 

We then have x' £ (—oo, oo). The total state vector may be expanded as 

M = /*V K»)M- “■“> 

The complex number (x'\ ip) is the contribution to the state vector |^) from position x'. Hence, this is in fact 
nothing but the familiar wavefunction in position space: 

W) = (x'W). (1.27) 

We see that 'ip(x) = (x\ip) are the components of | ip) with the basis vectors \x ). If x instead took discrete values, 
we could have written 

(i>(x i)\ 

\i>) = I i’ fa) | • (i-28> 

Since x is a continuous variable, we must ^-function normalize the basis-vectors: 


<*v> 

1 

II 

(1.29) 

The scalar product between \ipi) and |^ 2 ) may be written as 


(V’llifo) = 

1 dx(ipi\x){x\ip2) 

(1.30) 


j dx(x\tp 1 )* (x\lp2) 

(1-31) 


j dxipl(x)ip 2 (x), 

(1.32) 


where we made use of the completeness relation f dx\x) (x\ = 1. Let us also consider how to work with operators 
in this representation. The expectation value of F may be written as: 

The first and last factors inside the integral are wavefunctions, as we showed previously, so it remains to clarify 
what the matrix elements (x"\F\x') are. If F = x , it is simple. We then have: 

(x"\x\x f ) = x'{x"\x') = x'6(x” - x'). (1.34) 

More generally, if F is a function of x [F = F(x)], then 

(x // |F(x)|x / ) = F(x')S(x" - x'). (1.35) 

This follows for any power of x since x n \x') = ( x') n \x' ), and thus the same is true for any function F(x) that 
may be expanded in powers of x. 

What about the case F = p x = pi We know that [x,p\ = ih , and thus 

(x"\xp — px\x') = ih(x"\x') = i hS(x" — x'). (1.36) 
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The second term on the l.h.s. is 


{x"\ -px\x') = -x'(x"\p\x'). 

The first term on the l.h.s. may be computed as follows: 

{x "^ W) =h i{z " 

= x"{x"\p\x'). 


(1.37) 


(1.38) 


An alternative way to show this explicitly for the second term is as follows. First, note that if A\a) — A|a), then 
(a\A^ — (a|A*. We then see that 


(x"\px\x') = (x'\(pxy\x")* = (x'\tfpt\x")\ (1.39) 

But x and p must be Hermitian so that x^ — x and similarly for p. By using this, we obtain that 

(x'\x^p^\x")* — x'(x'\p\x")* 

= x'(x"\p\x'), (1-40) 

which is consistent with Eq. (1.37). Combining the results we obtained so far, we then have that 

(x" — x')(x"\p\x') = ihS(x" — x'). (1.41) 

Now, a fundamental property of the ^-function is that x d5 ^ — —S(x). We now use this by letting x s x" — x' 
and hold x' to be constant. It follows that 

[x" — x f )-^-^5{x" — x') — —S(x" — x'). (1-42) 

We can then rewrite Eq. (1.41) to 

{x"\p\x’) = - X 1 ). (1.43) 

1 ox" 

This can be further generalized to a power p n : 

( x ”\P n \ x ') = 5(x"-x'). (1.44) 

Since we have now proven that for an arbitrary function F(p), we have 

{x"\F(p)\x') = - x'), (1.45) 

it follows that in the most general case where the operator depends on both p and x, we have: 

(■ x"\F(p,x)\x ') = f(y ^jj,x"^S{x" -x'), (1.46) 

Since we now know this expectation value, we can finally go back and evaluate the expression we started out with: 

= J dx" j dx'ip*(x")F(^-^-^, x"^5{x" — x')ip(x l ) 

= J (!- 47 ) 

In the end, we see that this is precisely how we are used to evaluate expectation values in the wavefunction 
formulation. Hence, there is consistency between the general formulation of QM and the position representation. 
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We may also show that the two versions of the Schrodinger equation (SE) are consistent. The general formula is: 

\hdt\ip) = H\ip) (1-48) 

and can be brought to the position representation by multiplying with (x| from the left side, so that one obtains 

(149 > 

We previously established that (x\i/j) — ^(x). For a Hamiltonian operator H — it follows that 

i hdti/j(x) = j dx'H^ x"jS(x — x f )xjj(x f ) = Hx S ji/j(x). (1.50) 

Summarizing, we see that when the eigenvectors for the position operator are used as basis vectors, the general 
formulation of QM is reduced to wavemechanics in position space. In the same manner as above, the wave- 
mechanics in momentum space is contained in the general formulation of QM. In this case, we want to use the 
eigenvectors | p) of the momentum operator p as basis vectors. The wavefunction in the momentum representation 
is then (j>(jp) — (p\i/)). 

Interestingly, the wavemechanics formulation in position space was not the first one to be developed. Instead, 
the matrix mechanics formulation of QM was the originally developed representation by Heisenberg in 1925, 
six months before Schrodinger developed the wavemechanics. In the matrix mechanics case, the state vector is 
projected down on an arbitrary, discrete, orthonormal set of basis vectors | k), k — {1,2,...}. A vector \a) may 
then be expanded as \a) — ak\k) where = (k\a). These coefficients can be visualized as components of a 
vector: 


a x 


a — 


a 2 


(1.51) 
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The scalar product (b\a) is then 


We see that 


(%) = E^ k ^ k ^ 

k 


^2(k\b)* (k\a) = ^b* k a k . 
k k 


Ca = [b\, 


U 2 , ••• 


= J2 b k a k- 


(1.52) 


(1.53) 


With this representation, an operator A has an expectation value which is a matrix with elements A mn m (m\A\n). 
If | b) = A|a), we then obtain that 

(m\b) - (m\A\a) = ^^(m\A\n}(n\a), m = 1 , 2,... (1-54) 

n 

which in turn can be written as b m — AmnUn- But this is nothing but the very definition of matrix multiplica¬ 
tion: 


bi~ 


'A u 

A12 . . . 


CLl 

&2 

— 

A21 

^-22 • • • 


a 2 


In effect, the result of acting with the operator on the state vector is represented hy conventional matrix mul¬ 
tiplication. This representation is commonly used and its most important application is on the stationary SE 
H \ip) = E\f>). If we know the eigenvalues | n) (although usually we do not: the task is to find them), using them 
as basis vectors gives: 


(yTi\H\ri) — E n (^Tn\Ti) y E rnn — E n S rnn . 

We used that H\n) = E n \n). The matrix-representation of H is then diagonal. Explicitly, we have 


(1.56) 


'E 1 

0 ...' 


a\ 


ai 

0 

e 2 ... 


a 2 

= E 

a 2 


The solution for the eigenstates becomes a n = S mn , E = E n . However, if the eigenvectors are not known , one 
has to use a different basis set for which (m\H\n) is not diagonal to begin with, i.e.: 


H u 

#12 •••' 


Cl 


Cl 

H 21 

#22 ... 


C 2 

= E 

c 2 


The task to solve the SE is then mathematically equivalent to changing the basis, c& = ^ k S n kCLk, so that the 
matrix becomes diagonal. This is a standard method, suitable for numerics, which we later will use for degenerate 
perturbation theory. 


E. Briefly about the Schrodinger- and Heisenberg-picture 

So far, we have described quantum mechanical systems by a state \f>) which "moves" in a Hilbert space where the 
axes (basis vectors) are time-independent. This is known as the Schrodinger picture. However, it is fully possible 
to take the perspective from a rotating coordinate system. The simplest option is in fact that the rotation of the 
system is such that the state vector is at rest. This is known as the Heisenberg picture. 

Let us first recap how time-evolution is treated in the position representation. Since the SE is linear and 1st order 
in time, the propagator U = U(r , t; ro, to) determines the evolution of the wavefunction from t 0 to t: 

ty(r,t) = j U(r,t;r 0 ,t 0 )'&(r 0 ,to)dr 0 . (1.59) 
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For a Hamilton operator that does not depend explicitly on time, we can expand 

VI ->{r,i) = Y J CnMr)z-' lEnt,h (1-60) 

n 

where c n are determined by 

c n = e lEnto/h J y(r 0 ,t 0 )ip*(r 0 )dr 0 . (1.61) 

Inserting this c n into the expression for \h, we obtain an equation for the propagator: 

U(r, t- p 0 , t 0 ) = r n (ro)Mr)e- i{t - to)En/h . (1.62) 

n 

If we instead have a continuous eigenvalue spectrum, the summation is replaced by an integral: 

/ oo 

V’p(^o)V’p(^)e _l(t_to)Bp # (1-63) 

-OO 

where p is the eigenvalue parameter. To be concrete, consider the example of a free one-dimensionally moving 
particle for which 


%{x) = 



E p = p 2 /2m. 


d-64) 
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The propagator then turns into: 

1 f‘°° 

U{x,t-X 0 ,t 0 ) = — / e ip(x-x 0 )/h e -i(t-t 0 )p 2 /^h dp 
27T/Z- J _ OQ 


Am(x—xo) 2 /2h(t-t 0 ) 


27rih(t — to) 

With this in mind, let us now turn to the general formulation of QM. The time evolution is given by 

\y(t)) = U(t,t 0 )\*(t 0 )}. 

If the Hamilton operator does not contain time explicitly, we have 

U(t,t 0 ) = 

where the exponential operator should be interpreted via the formula 


n x 


00 vn 

= Y—. 

^ ml 


(1.65) 

( 1 . 66 ) 

(1.67) 

( 1 . 68 ) 


n=0 


We see that |U'(<)) = e l( * f °^/ a |'P(to)) satisfies the time-dependent SE itl, |*P(7)) = H\'V(t j). The correspond¬ 
ing bra to the above ket is 


(1.69) 

(1.70) 


(*(t) | = <^(t 0 )|e i(t -* o)i ^ /fi . 

Normalization is thus preserved since: 

We may compute the expectation value of some physical quantity F at the time t in the usual way: 

(F) = (tt(t)|F|tf(t)> = (^(t 0 )\F H \^(t 0 )) (1.71) 

where we defined 

p H — e i(t-to)H/hjp e -i(t-t 0 )H/h' (1.72) 

We see that (F) can be expressed in two equivalent ways: 

• Schrodinger picture: expectation value of a time-independent operator F in a time-dependent state. 

• Heisenberg picture: expectation value of a time-dependent operator Fh in a time-independent state. 

We see that Fh = WFU where the evolution operator satisfies W — U~ l , meaning that it is a unitary operator. 
In the Schrodinger picture, we know that 

|(F) = i([H,F]>. (1.731 

In the Heisenberg picture, we may differentiate Fh to obtain the equation: 

= d.741 

Note that the commutator relations are preserved when making a transition to time-dependent operators. If 
[A, B] = C, then 

[A h , B h } = [UAU, U^BU] = U ] (AB - BA)U = U^CU = C H . (1.75) 


Example 1. Heisenberg picture representation of creation and annihilation operators. For the creation and 
annihilation operators a t and a of a harmonic oscillator, we obtain from Eq. (1.74): 


dan 

dt 


. da tt „ + 

= -i ua H , = i ua' H . 


(1.76) 


The solution is straightforward to obtain: 


a-H(t) = e lwt a H ( 0), a\j(t) = e lut a ] H { 0). 


(1.77) 
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It is also worth mentioning that it is possible with an approach where only part of the time dependence is transferred 
to the operators. This is the interaction picture , which is often used when the Hamilton operator can be written 
as H = Hq + Hi, where Hi has to be handled via perturbation theory. We may then transform with U 0 = 
e -i(t-t 0 )H 0 /h so the state vector would be time independent if Hj could be neglected. Which picture that one 
ultimately decides to use is a matter of convenience: the physics is the same. 
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II. HARMONIC OSCILLATOR: CREATION AND ANNIHILATION OPERATORS & COHERENT STATES 

Learning goals. After reading this chapter, the student should: 

• Be able to work with creation and annihilation operators and know their mathematical properties. 

• Know how to describe coherent states and why they are physically significant. 


In introductory QM courses, one learns about a wavemechanical treatment of the ID harmonic oscillator. Let us 
now use operator-algebra for the state vectors in Hilbert space to study the same problem in a simpler and more 
elegant manner. 


A. Creation and annihilation operators 


The Hamilton operator for a harmonic oscillator is known from introductory courses on QM, namely: 


H - k + 2 


It follows that the equation 


H 


huj 2mfrhj 2 ft 


P 


( 2 . 1 ) 


( 2 . 2 ) 


is dimensionless since [ftcc] = energy. Instead of q and p, we now introduce the dimensionless operators a and a t 
(dropping the superscript. ?. for brevity of notation) 


mu ^ 


2ft ^/2mfrw 


P, a 


t _ 


2ft H \[2mhujP 


(2.3) 


While q and p are Hermitian operators, we see that a and a t are not since a ^ a ). It is also useful to note the 
inverse relations 


q = 


/ ft”, + N . . mhuj, | v 

V2^ (a + a) ’ p = 1 V^ _(o o) - 


Keep in mind that [q,p\ = ift. It then follows from Eq. (2.4) that 

i muo 1-2 i f A A ^ H 1 

aa = w q + ^ p + Th {qp - pq) = i^- 2 - 

Similarly, one shows that aa 1 = H/frw + 1/2. Combining these results, one obtains 

[a, a/] = aa ^ — a^a = 1. 


(2.4) 


(2.5) 


(2.6) 


We have now found a very simple expression for H : 

H = hu(a ( a + -). (2.7) 

The next step is to find the eigenvalues of H. This amounts to finding the eigenvalues of N = a^ a, since H = 
(N + \)frio. The quantity N is known as the number operator , the reason being that the eigenvalues of N are 
positive integers. We will now prove this. The following relations will be useful in order to accomplish this task: 

[N, a] = a)aa — aa)a = (a)a — aa))a = —a, 

[N, al] = a)aa) — a)a)a = a) [aa) — a)a) = a). (2.8) 

For reasons that will become clear soon, a is known as the annihilation operator while a 1 is the creation operator. 
To identitfy the energy spectrum, let | n) be the orthonormal eigenvectors for H with eigenvalues E n , so that 
H\n) = E n \n). To find E n , let us start by examining a\n). Using the above relations, we find that 

Ha\n) = aH\n) — hu;a\n) = ( E n — Huj)a\n). (2.9) 
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We thus conclude that as long as a\n) ^ 0, a\n) is an eigenvector of H with eigenvalue E n — Jtlj. This argument 
can be generalized: a 2 \ n) has eigenvalue E n — 2 hw and so forth. This cannot continue forever, however, since the 
energy of a harmonic oscillator cannot be negative. To see this, recall that the norm of a vector is always > 0, and 
we have that: 


||a|n)|| 2 = (n\a) a\n) = {n \A - 1| n) = ( 2 - 10 ) 

Therefore, we must have E n > fwj/2. In order to guarantee that this is the case, there must exist a final eigenvector 
|0) so that a|0) = 0. The belonging energy to the state |0) must be the lowest energy available, so that 

H\0) /ia;(a t a+ ^)|0) = E 0 = (2.11) 

Now, since we could reach this state from any higher-energy state by moving downwards with energy steps of Ejj , 
we conclude that the general eigenvalues must be 


E n = (n+^)hw. (2.12) 

This is consistent with the known result derived in a more complicated way in introductory courses of QM, but we 
managed to find it in a quite simple and elegant manner using the general formulation of QM. 

Let us then turn to the eigenvectors. First, note that since H — (N + \)Ejj and E n = (n + \)Ejj, it follows that 
N\n) = n\n). The eigenvalue of N thus denotes by how many energy quanta Ej that the energy of the system 
exceeds the ground-state (lowest energy). We have that Ha\n) = (n — \)Eja\n ). But since (n — is the 
eigenvalue of the state \n — 1), we must have |n — 1) = c n a\n). Here, c n is a constant which we can determine 
through normalization: 


1 = ( n - l\ n - 1) = \c n \ 2 {n\(Ja\n) = |c n | 2 (n|7V|n) = \c n \ 2 n. (2.13) 

Therefore, c n — e l6 / yjn where S G 3ft. We set 5 = 0 for now and thus obtain the central result 

a\n) = y/n\n — 1). (2.14) 

However, if time-dependence is included in the notation | n) for stationary states, then 

| n) cx e ~ iEnt / h = e - i ( n+1 / 2 ) u;t (2.15) 

which means that S becomes time-dependent: 

a\n) = e~ lujt ^/n\n — 1). (2.16) 

To find aJ\n), we operate on the above equation on both sides with a t to find 

e~ lujt y/na^\n — 1) = a^a|n) = N\n) = n|n), (2.17) 

which after rearranging the equation produces 

a^\n) = e iut yjn + 1| n + 1). (2.18) 


This time-dependence is disregarded in the rest of this section, which means we set t — 0. Summarizing so far, we 
have then found the following two fundamental relations regarding how annihilation and creation operators act: 

a\n) — y/n\n — 1), a)\n) — \fn + 1 |n + 1). 

We can finally understand why a t is referred to as a creation operator, since its effect is | n) \n + 1) (creates one 
quantum of energy). In the same way, a is the annihilation operator since | n) \n — 1). Any excited state | n) can 
thus be obtained by acting on the ground state |0) n times with a) \ 

I n) = -L( at )”l°)- ( 2 - 19 ) 

Vn\ 
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Example 2. Computation of the expectation value of the potential energy of a harmonic oscillator in the 
state | n). We know that 

V = -muj 2 q 2 = -mu 2 ^ (a + a^) 2 . (2.20) 

2 2 2mu v 

Inserted into (n\V\n), we obtain 

(n|-faj(aa + ao) + a^a + aV)|n) = (n\-huj(aa^ + a^a)|n) 

= + l\/n + 1 + y/ny/n) 

= + 1/2) = £ n /2. (2.21) 

The average potential energy is thus equal to the average kinetic energy, namely 50% of the total energy in state 
\n). Note that, in comparison, if we wanted to compute (q 2 ) in the position representation, it would have been 
necessary to evluate an integral with the square of a Hermite-polynomial, which is a much more difficult task! 


For completeness, let us show how the position representation wavefunctions are recovered from the eigenstates 
177/) = —L(af) n |0). We know that the starting point to find i^> n {q), where q is the position coordinate, is ^ n {q) — 

(q\n). Using the completeness relation f dq'\q')(q'\ = 1, we obtain 


(q\n) = ;J= j dq l (q\(a i ) n \q')(q'\0). 


( 2 . 22 ) 


First, we evaluate 




Inserting this into Eq. 



(2.23) 


(2.24) 


We see that the n-th wavefunction 'ipn = (q\n) is expressed via ipo (q) = (g|0). We determine t/>o (q) by the criterion 
that defined |0), namely a|0) = 0. Projected onto \q), we get: 


In turn, this yields 


(<?l°|0) = J dq' (q\a\q')(q'\0) = 0. 

(,|«|0) - o - /<¥(« l/lU 

( Imuj ^ h d \ . 

= y\^h q+ 7^dq) m ' 

This means that we have obtained the following differential equation for the scalar (q |0): 

d . . . mu . . . 

- -- r «(«io>- 

It can be readily solved to yield ln(#|0) = —^q 2 + C where C is a constant. Therefore, 

<g|0> = e c e- mw « 2 / 2fi = 

V 7 rh J 


(2.25) 


(2.26) 


(2.27) 


(2.28) 
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where we determined the constant through normalization. Now, by inserting this back into our expression for 
ipn(q) we get: 


^Pn 


ma V /4 1 ( r d Y c -xV 2 

nhJ dx) 


(2.29) 


where x = q^/muj/h, which is the correct result for the position representation wavefunction. 


We mention in passing that we can now also identify the matrix representation for the operators by using the energy 
eigenvectors as a basis. From a\n) m y / n\n — 1) and a)\n) m %/n + 1| n + 1), we see that 



o 

o 

o 


" 0 0 0 0 ..." 


o 

o 

o 


o 

o 

o 

a = 

• o o 

• o o 

• o o 

: 

, a f = 

0 V2 0 0 ... 

0 0 a/3 0 ... 


B. Coherent states 

The eigenstates of the annihilation operator a, |a), are known as coherent states: 

a\a) oc |a). (2.31) 

The reason for this is that the time-evolution of such a state does not cause the state to spatially "diffuse" and 
become delocalized. Instead, the state’s spatial distribution oscillates with a preserved width of the oscillation as 
we now shall prove. 

We expand the eigenstate of the operator a in energy eigenstates: 

oo 

\a) = Y J C n \n). (2.32) 

n —0 

We showed previously that: 

oo 

a\a) = e~ lut c n \fn\n — 1). (2.33) 

n —0 

When c n yfn = ac n _i, where a is a constant, a\a) becomes proportional to \a). Using this relation, we have 
c n = coa n / \fn\, so that 


\ a i \ 

*} = c o > -/= \n) « 
Vn! 


n=0 


,-|«| 2 /2 


\ Ct 


n =0 


(2.34) 


We have chosen co so that (a|a) == 1 by using that X]^Lo( a * a ) n / n - = e ' a ' 2 - These states then satisfy a\a) = 
e~ luJt a\a) and the expectation values for a and a) are: 

(a|a|a) = ae~ lujt , (ala^o:)* = cx*e ia;t . (2.35) 


It remains to justify why we have said that these are known as coherent states. In order to see this, we consider 
how these states behave spatially. Since q = yJh/2mnuo(a + a^), we can show that: 


IteHl 2 


/tTTXU rnuj [q_q 0 C os {ujt—0)] 2 /h 

\ irh 


(2.36) 


where a = |aje 10 . The meaning of this inner product is the distribution of the spatial position, which is seen to 
describe an oscillating wavepacket which maintains a constant width as time evolves, hence the name coherent 
state. 
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III. TIME-INDEPENDENT APPROXIMATE METHODS 

Learning goals. After reading this chapter, the student should: 

• Know the fundamental idea behind non-degenerate and degenerate perturbation theory, when they are valid, 
and be able to mathematically outline how to apply them on a quantum mechanical problem.. 

• Know the fundamental idea behind the variational method, when it is valid, and be able to mathematically 
outline how to apply it on a quantum mechanical problem. 

• Know the fundamental idea behind the WKB-approximation, when it is valid, and be able to mathematically 
outline how to apply it on a quantum mechanical problem. 


Only rarely is a QM problem exactly solvable. Thus, having a "toolbox" of useful approximative methods is 
indispensible for a physicist. In this chapter, we will establish precisely such a toolbox. 


A. Non-degenerate perturbation theory 

Assume that Hq corresponds to an exactly solvable problem. Often times, a physical system may be described by 
a H which only slightly deviates from Hq. Then, H — Hq is the perturbation of the system. Assume E® and | n) 
are known for Hq 


H 0 \n) = £». (3.1) 

We want to find eigenvalues and eigenstates for the perturbed Hamilton operator H = H 0 + \H±, where Hi is time 
independent. Here, A is an expansion parameter which is assumed to be small. This kind of perturbation theory 
is suitable and commonly used in the context of atomic energy levels influenced by E or B fields. We start by 
assuming that the unperturbed energy level is non-degenerate. The exact eigenvalue problem can then be written 
as: 


(#o + A#i-S„)|V’n) = 0. (3.2) 

We now expand the eigenvalues and eigenstates in corrections to the unperturbed solutions: 

E n = E 0 n + \E^+\ 2 E^ + ..., 

\i>n) = l« (0) ) + A|n (1) ) + ... (3.3) 


and thus obtain 


(Hq + A Hi -E%- A E& - .. .)(|n (0) ) + A|nW) + ...) = 0. (3.4) 

For brevity of notation, we use | n) = \n (°)) in what follows. If this is to be valid for all A, the equation must be 
fulfilled for each power of A. We obtain to O (A 0 ): 

(H 0 — E^)\n) = 0, (3.5) 


while to order G( A 1 ): 


(Hq - £< 0) )|ra (1) > + (Hi - EW)\n) = 0 (3.6) 

and finally to order G( A 2 ): 

(H 0 - E^)\nW) + (Hi - E^)\n^) - E^\n) = 0. (3.7) 

The 0th order equation is known to be valid from the outset, since it corresponds to the exact unperturbed problem. 
If we multiply the 1st order equation (n\ from the left we obtain: 

(n\H 0 - E° n \n «> + <n|ffi|n) = e£\ (3.8) 

The first term is zero since it is equal to (n^ \Hq — E^n)* — 0. Therefore, we obtain 
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XE^ = (n|Atfi|n). 

This is the lowest order energy correction. It can also be written explicitly as 

A EM = j^YXH^dr (3.9) 

What about the correction to the eigenstates? Multiply the (9(A) equation with (m| where m^nto obtain 

(m \H 0 -E^\n^) + (m\Hi\n) = 0. (3.10) 

Defining E ( - = E^' 1 and using that 

(m\H 0 - E° n \n «) = (n^\H 0 - E° n \m )* 

= (E° m -E° n )(m\n «), (3.11) 

we obtain 

( 3 - 12 ) 

It is now clear why a problem would arise if the unperturbed eigenvalues were degenerate, since the denominator 
would beome zero then. By finally expanding Itt/ 1 )) in the unperturbed eigenstates 


rn 


via the completeness relation, we end up with the first order correction to the eigenstates: 


i n(1) > = 5 ? 

m^n 


(m\Hi\n) 

K-E° m 


\m). 


(3.13) 


(3.14) 


We have now determined the eigenvalues and eigenstates up to (D (\ l ). For some applications, it turns out that 
(n\Hi\n) = 0, which means that we have to go to second order in A to find the first non-vanishing correction. 
Following a similar procedure as in the first order case, one obtains for the eigenvalues 


E„ 


S° + (n|Aff 1 |n}+ Y, 

m^n 


IHAgiNI 2 

E^-E^ 


(3.15) 


Note that if we are perturbing the ground state, then E ^ > E®, which means that E will always be negative. 
Moreover, the above expression gives a criterion for the applicability of this method, namely that 

|(ra|AiT 1 |n)| « \E% — E^\ (3.16) 

so that the correction to E n is indeed small as assumed. This type of approximation theory is known as Rayleigh- 
Schrodinger perturbation theory. 


Example 3. Relativistic correction to the Coulomb-levels. Even if the levels above the ground-state have a 
degeneracy, we can still use our approximation theory because the perturbation matrix elements between degen¬ 
erate states, (m\XHi\n), turn out to vanish. The relativistic expression for kinetic energy can be expandaed in 
momentum as follows: 

— me 2 = me 2 — me 2 (3.17) 

and when assuming that |p| <C me, we obtain 

2 4 

m(?\/ 1 + p 2 /m 2 c 2 — me 2 ~ —- ^ + ... (3.18) 

' 2m Sm 6 c z 
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Here, m is the rest mass of the particle. The perturbation is then: 


XHi = -- 


n 4 


:V 4 . 


8m 3 c 2 

We know that the first order correction to the energy eigenvalue is the expectation value: 

n 4 


XE^ = (n\XHi\n) = -Z— j l^nfdr. 


(3.19) 


(3.20) 


This result was obtained by performing two partial integrations. The integral is most easily evaluated by using the 
SE for the Coulomb-potential which reads 


2 m 


V 'Ipnlm 


Ze 2 


'finlm T -^n^n/rn 


4ne 0 (r 2<m 2 ) Vw 


Inserted into our expression Eq. (3.20), we obtain 



(3.21) 


(3.22) 


This expectation value may be computed by using the known form of the hydrogen wavefunction. Introducing the 
fine-structure constant a = e 2 /An eohc, we obtain 


E n i = me 2 



Z 2 a 2 
2 n 2 


Z^oi 4 / n 3\i 
n 4 V2/ + 1 _ 8/J' 


(3.23) 


The term oc Z 4 is the lowest order relativistic correction to the energy level. Importantly, the energy level is 
now not only dependent on n, but also on the angular momentum quantum number /. This means that the energy 
spectrum has acquired a fine-structure. This result is correct for a spinless particle. The result is slightly modified 
for e.g. an electron that has spin 1/2. 
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B. Degenerate perturbation theory 

Consider now the case where the eigenvalue E® for the unperturbed Hamilton operator H° is degenerate. Let there 
be g orthonormal states |ni), \n 2 ), ... with eigenvalue E®. An example is the four states |200), |211), |210), |21 — 
1) corresponding to the first excited level for the Coulomb potential. We again expand in powers of A. For the state 
vector: 


9 

KU = ^o r |n r ) + A|V 1 ) + ... (3.24) 

r=l 

with so far unknown coefficients a r . To first order in A, the time independent SE gives: 

9 

(Ho ~ K)Wi) + (Hx ~ Eg*) J2“r\nr) = 0. (3.25) 

r =1 


Multiplying from the left with one of the unperturbed states (n s |: 

9 

Y J (n s \H 1 -E^\n r )a r = Q 

r= 1 


(3.26) 


where we used that (Ni\H 0 — E^\n s )* = 0. In Eq. (3.26), all the matrix elements are known: 

{n s \Hi\n r ) = H' sr . (3.27) 

They are computed via the unperturbed states. Since (n s \n r ) = 6 sr , we get 

9 

~ E^5 sr )a r = 0, s =1,2,... g. (3.28) 

r=1 


This is in fact a homogeneous set of equations for the unknown a r : 


TTf Z7»( 1) 

el ii — xL/n 

H[ 2 

to -4. 

H' 2 2 - E { , 

. H' g x 

H ' 3 2 


K 

H 2 3 


H’ 99 - E 


( 1 ) 



~aE 


a 2 


- a g- 


= 0. 


(3.29) 


This only has a non-trivial solution for the coefficients {a r } when det(M) = 0 where M is the matrix in the 
above equation. This gives an equation of the g-th degree for En \ If all g solutions for are different, it 
means that the perturbation XHi has completely lifted the degeneracy of the energy level and split it into g levels. 
This method for degenerate levels can and should be used on a level which is not exactly degenerate, but nearly 
degenerate, so that the criterion \(m\XHi\n) <C \E® — E^\ is not satisfied. Here, | n) and | m) are unperturbed 
eigenstates of the Hamiltonian. 


As an application of this framework, we consider the Stark-effect: the displacement of energy levels due to an 
external constant electric field E. Choosing z as the direction of the field, we get \R\ — eEz where E — Ez. The 
perturbation is thus the potential energy for a charged particle in an electric field. Assume that the particle is an 
electron in a Coulomb potential and that the field is so weak that perturbation theory is permissible. Let the energy 
states in the Coulomb potential be denoted | nlm). The ground state 1100) is non-degenerate, and the correction to 
the ground state energy E\ becomes 

A e[ 1] = e£(100|z|100) = eS J z\tp w0 \ 2 dr. (3.30) 

However, this integral is zero due to symmetry since ^100 oc e -r /°. Therefore, the lowest order non-vanishing 
correction to the ground state is 2nd order in the perturbation (the field E): 

Ei = Ei — constant x E 2 . (3.31) 

The constant may be evaluated using our formula for the 2nd order correction and one finds: 

+ (3.32) 
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where we emphasize that E® < 0. The correction to the ground state is atypical, because it is quadratic in the 
perturbation. For all other levels, the Stark effect is actually linear in the field S — \S\, and the reason for 
this is that all excited energy levels are degenerate. Consider for instance the n = 2 level which has a 4-fold 
degeneracy: |200), |210),|211),|21 — 1). To do perturbation theory for a degenerate level, we need the matrix 
elements e£(2lm\z\2l'm'). Several of these vanish: 

• Diagonal elements (2lm\z\2lm) are zero due to symmetry, just like the first-order term for the ground state. 

• All elements where m / m! vanish, the reason being that z — r cos <9 does not contain (/>, while ex 
e i m ^. As a result, the (^-integration gives 

/ e - iro ^e im '^ = 2 tt<W (3.33) 

Jo 


In effect, we fortunately only need to evaluate the matrix elements (210|z|200) and (200|z|210) = (210|z|200)*. 
Therefore, it suffices to compute 

e£(210|z|200) = e£ f ^ 210 ^ 200 ^. (3.34) 

The wavefunctions in the integral can be derived or looked up in a table and we simply write the result here: 

^210 = (327rao) _1/2 rao 1 e -r / 2a ° cos#, ^200 = (327rao) _1 / 2 (2 - ra 0 ' 1 )e _r / 2ao . (3.35) 

Inserted into the integral one obtains 

e£’(210|^|200) = -3eSa 0 . (3.36) 


The determinant that provides us with the first order energy correction is then: 


-E ( 2 ] -3ea 0 e 0 0 

—3 ea 0 £ -E^> 0 0 

0 0 -E^> 0 

0 0 0 -E ( 2 1] 


(E^fKE^) 2 - (3ea o £) 2 ]=0. 


(3.37) 


The solutions are E 2 ] = 0,0, ±3eao£. We thus see that the degeneracy is not completely lifted: the field splits 
the n — 2 level into three levels instead of four as shown in the figure. 


£>0 


E\ + 3 ea 0 £ + G(S 2 ) 


E = 0 


E *2 + 0(£ 2 ) (degenerate) 


' - E° - 3ea 0 S + G{£ 2 ) 

We here assumed that the field S is weak in order to use perturbation theory, but what does weak mean quantita¬ 
tively? Let us compare the energy splitting due to the field with the distance to the next unperturbed energy level 
which is 

E° - E° 2 = (t - I)e°| = 1.89 eV. (3.38) 

The ratio between the field-splitting of the levels and the above energy gap is then 

3ea ° g =_ L _ (3 39) 

El~E\ 1.2 x 10 10 V/m ^ ; 

We may conclude that our approach is valid so long as S = \S\ <C 10 10 V/m, which is an extremely large electric 
field. Finally, it is instructive to consider the state belonging to the lowest energy level E^ = —3eaoS. The state 
is specified by computing the {aj} coefficients in our previous derivation and one finds that 

IlM = ^(1200) - |210)). (3.40) 

This state has a finite dipole-moment along the z-axis, namely 

d — (-0_| — ez\i\)E) — 3aoe. (3.41) 

Therefore, the physical meaning of the energy shift due to the electric field is that it represents the dipole-energy 
-d£. 
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C. Variational method 

There are problems where one cannot split H into an exactly solvable part and a small perturbation. In such events, 
perturbation theory is not applicable and we may instead employ a so called variational method. This method is 
particularly useful to determine the lowest-lying eigenvalue Eq . It is based on the fact that the expectation value 
H in any state |/) must be > Eq: 


(mil 

</!/> 


> Eq. 


We prove this result using the wavemechanics formulation of QM. First, expand the states / in the eigenfunctions of 
H, so that / = c n ^ n . Using the orthonormality of the set {^? n }, we obtain f f*fdr = l c n| 2 - Therefore: 

j f*Hfdr = Y J \cn\ 2 E n . (3.42) 

** n 

Since E n > Eq per definition, we obtain 

J tHfdr >E 0 J1 M 2 =E 0 J f* fdr, (3.43) 


which completes the proof. The equality sign is obtained only if the state / actually is the ground state t/’o • In other 
words, E 0 is obtained by minimizing the functional E[f] with respect to the function / where 


m = 


iFHfdr 

ff*fdr 


(3.44) 


The variational method then consists of selecting trial functions / that depend on one or more parameters, comput¬ 
ing E[f], and then minimizing it w.r.t. /. The result will be an upper limit for Eq, and the lowest value obtained 
will always be the best. To be successful, one should ideally try to guess on a trial function form / which seems 
physically reasonable for the system. 



Alcatel-Lucent 


www.alcatel-lucent.com/careers 


What if 
you could 
build your 
future and 
create the 
future? 


One generation’s transformation is the next’s status quo. 
In the near future, people may soon think it’s strange that 
devices ever had to be “plugged in.” To obtain that status, there 

needs to be “The Shift”. 


Download free eBooks at bookboon.com 



25 













INTERMEDIATE QUANTUM MECHANICS 


TIME-INDEPENDENT APPROXIMATE METHODS 


Example 4. Triangular well. We use the variational method to estimate E 0 for a triangular well, where V(z) = oo 
for z < 0 and V(z) = Fz for z > 0. 



that the force is F = e£. This is a commonly encountered situation in experimental electronics when one wants 
to create artificial 2D electron systems. What kind of wavefunction should we expect in this system? It should be 
zero upon entering the V = oo region (and in the region itself) and also fall off as z increases. Thus, a possible 
choice which satisfies this is f(z) = ze~ az / 2 for z > 0. We then obtain 


m = 


/o°°(-ft 2 / 2m )//" rf2: + -Fjp 00 zpdz 


/o°° f 2 dz 

All integrals may be evaluated analytically, and in total one obtains 

_ ft 2 a 2 3 F 

E[f] -2^T + ^' 


(3.45) 


(3.46) 


Here, a is the free parameter that we may adjust in order to obtain as good a guess as possible for Eq. In effect, 
we want to minimize E[f] with respect to a. Setting dE[f]/da = 0 gives 


(12mF\ 1 / 3 

“ - (-^-) ' (3 ' 47) 

The corresponding minimum value of E[f] for our particular trial function is then 2.48(/i 2 /2ra) 1 / 3 F 2 / 3 for that 
choice of a. Now, we don’t know how good this result is, i.e. how far away from the true ground state energy it 
is. In this particular case, however, we are lucky because the triangular well problem can actually be solved exactly. 


To see this, consider the SE for z > 0: 

h 2 d 2 ib 

- — ^+Fz^ = E ^. (3.48) 

Now, introduce the quantities k = (h 2 /2mF) 1 / 3 and x = z/k in order to bring the equation to dimensionless 
form: 


d V 

dx 2 


{x - E)i/j = 0, 


(3.49) 


where we defined E = E/Fk. The key observation here is that the equation y" — xy = 0 is Airy’s differential 
equation, which has two known independent solutions: y = Ai(x) and y = Bi(x). While Bi(a?) diverges for large 
x , and thus is physically unacceptable in our system, Ai(x — E) has an acceptable behavior as it decreases for 
x — E > 0. The physically acceptable solution to the SE for this system thus has to be ^{x) = Ai(x — E) where 
the definition is: 

1 f°° 

Ai(x) = — / cos(xz + z 3 /3)dz. (3.50) 

ft J o 


Since -0(0) = 0 due to the infinite wall potential, we obtain the energy eigenvalues from Ai(— E) = 0. The smallest 
value of E must be the ground state, which is found numerically to occur at E = 2.33811. Since E = E/Fk , we 
get 


E 0 = 2.33811(ft 2 /2m) 1/3 F 2/3 . 


(3.51) 


Comparing with the result we obtained using the variational method, we see now that it was quite good: only 6% 
deviation from the exact result! 
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The variational method can also be used for the lowest-lying excited level E \, granted that we can choose a trial 
function that is orthogonal to the ground state. To see this, expand / = c n^n, thus excluding n — 0 since 

/ has to be orthogonal to t/’o- It follows, proceeding as we did before, that for this / we have 


(/I/I) 


>El 


(3.52) 


Symmetry can be used as a guideline to ensure orthogonality between / and ^o- For instance, in a ID symmetric 
potential, the first excited state is antisymmetric while the ground state is symmetric. If symmetry arguments are 
not available, another option is to compute the ground state as accurately as we can, and then ensure that / is 
orthogonal to that function. 


D. WKB approximation 

Whereas the variatonal method is useful for approximating the ground state of a system, it is useless for the 
purpose of determining highly excited states. In contrast, the WKB-method (Wentzel, Kramers, Brillouin) is 
particularly accurate for highly excited states, and also quite accurate for lower states. This method is also known 
as a semiclassical approach and the key idea behind is to assume that the potential varies slowly in space (we will 
later specify what this means quantitatively, i.e. how slowly it must vary). 

To outline the strategy behind the WKB approximation, consider the ID SE with a general potential V (x) 

rft 9 777 

V ) + —[E- V(x))i,(x) = 0. (3.53) 

We now try to solve this using the ansatz ^(x) = Q lS( ^ x )/ h , For V(x) = Vo, this is indeed an exact solution with 

S(x) = ±^/2m(E -V)x. (3.54) 

We may thus view ^{x) as a wavefunction with variable wavelength. Inserting it into the SE gives the following 
equation for S: 


(S') 2 - 2m[E - V(x)] - ihS" = 0. (3.55) 

If V(x) = Vo, then S" = 0. Thus, if the potential is slowly varying, it seems reasonable to solve Eq. (3.55) iter¬ 
atively while treating the term i HS" as a small perturbation. Let us use H as a book-keeping expansion parameter, 
similarly to what we did with A in previous perturbation theory. We expand 

S(x) = So{x) + HSi(x) + h 2 S 2 (x) + ... (3.56) 

Inserting this expansion into Eq. (3.55), we first collect the O(h 0 ) terms: 

(S' 0 ) 2 m 2m[E-V(x)\. (3.57) 


The solution of this equation is 

So(x) = ± j a/ 2 m[E - V(y)\dy + ci >± . 
7 x 0 

Here, ci t ± is a constant. Next, the O(h) terms provide the equation 


Integration gives: 


2 S’ 0 S[ = iSq S[(x) = 


1 si 

2 Si,’ 


s i( x ) = ^ln5o(a;) + c 2 ,± 


where C 2 ,± is a new integration constant. Since we now have identified So and S i, we find that 

_ e iS(x)/h ^ e i(So+hS!)/h^ 


(3.58) 


(3.59) 


(3.60) 


(3.61) 
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and by renaming the constants to A± = e 1Cl ± e 1C2 ± we thus obtain 


4>(x) 


^2m[E-V(y)]dy 

[E — V(x)] 1 / 4 


(3.62) 


This is the WKB approximation for the solution ip(x). Note that if V(x) > E (classically forbidden area), 'ip(x) 
exponentially increases or decreases since the exponent becomes purely real: 


m t ) = B± r ± l 

n> [V(x)-E]i/* 


(3.63) 


where we absorbed some numerical constants [(—l) 1 / 4 ] into A± and renamed it to B±. For a bound state, one of 
the coefficients B± thus has to be zero in order to prevent vp(x) from diverging. 


When is WKB valid? 

The premise of our approach is that the term i hS" is small compared to 2m[E — V (x)] due to a slowly varying 
potential. Letp 2 = 2m [E — V (x)]. In effect, we demand that 

|i hS"\ < \p 2 \. (3.64) 


If S" is small, it means that we may approximate 

(S') 2 - 2m[E - V(x)} - i HS" ~ (S') 2 - p 2 = 0. (3.65) 

Therefore, S' = p S" = p '. This gives us 

|i h dp/dx\ <C |p 2 | | <C 1. (3.66) 

Since h/p = A is the wavelength of the particle, this means that 



Physically, this means that the change in wavelength A over a distance A should be small compared to A itself in 
order for the WKB treatment to be valid, which can be satisfied by a slowly varying potential. 


Application #1: quantization with hard walls. 

Consider a potential containing two hard walls at x = xy and x = xh , so that V(x) = oo for x < xy and 

x > xh • 



Since the wavefunction is zero outside xy < x < xh, we must have ^(xy) = 'ip(xn) = 0. We then need a 

linear combination of the solutions 'ip(x) = [e-v^x)] 1 / 4 e±H ^ x ° ^ 2m ^ E ~ v ^ dy which vanishes at those points. 
One combination that satisfies this is 


if we demand that 


'ip(x) = A[E — V(x)\ 1 / 4 sin[(l /h) 



V 2m[E -V(y)]dy] 


(3.67) 


1 

h 



\/2m[E — V(y)]dy = nn, n — integer. 


(3.68) 
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This is effectively a quantization condition for the energy, which may be written as (using 1 /h = 2n/h) 

rXH 

2 / yj2m[E n - V(y)\dy = nh. (3.69) 

J XV 

In the simple limit that V(y) = 0, we obtain 2y/2mE n (xH — xy) — nh , which is an exact result. 

Application #2: quantization with continuous potential. 

Consider a continuously varying potential with a minimum. 


V(x) 



We should then expect to have oscillating WKB-solutions for xy < x < xh, but decaying solutions for x < xy 
and x > xh since those areas are classically forbidden. A problem nevertheless arises at the points where E = 
V(x) since the WKB-solution diverges there due to the factor [E — V (x)] -1 / 4 . The challenge is then: how do we 
connect the inner solutions (xy < x < xh) 

= [E- V{x)}~ x ' a [A + ^ f *v y/*np-v(v)1dv + A _ e ~if: v V* ™[E-v( v )\d V ^ (3 . 7 o) 

with the outer solutions 


^(x) — B-[V(x) — E] 1 / 4 e a ^ x v \/ 2rn \y(y) E ^ d v ^ x < xy, 

ip(x) = B+[V(x) - x> x H . (3.71) 
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Our strategy will be to treat the areas close to x = xy and x = xh exactly, since the potential can then be 
approximated as linear (via e.g. a Taylor expansion), and then use this exact solution to connect the inner and outer 
solutions. Close to the right turning point x = xh, we have V(x) — E ~ c(x — xh) where c > 0. The SE for this 
linear potential reads 


dp'll) 2me , 

-*»>'/• = » 

and is equivalent to the previously mentioned Airy’s differential equation 

d 2 ip t . 

where we introduced £ = ( J (x — xh)- We have previously looked at the solutions Ai(x) and Bi(x) and 
the non-divergent solution Ai(£) has the following asymptic behavior: 


(3.72) 


(3.73) 


a = j 2 /^ 1/4(3 73 for large positive £ 

t^(-0 _1/4 cos[|(-^) 3 / 2 - f] for large negative £. 

If we use the linear potential c(x — xh) in the WKB wavefunction, then for x < xh we obtain 


(3.74) 


l j* H VME - V(y)]dy = |(2 mc/h 2 )^ 2 ( XH - xf' 2 = |(-0 3/2 - (3.75) 


We then see that we can write the asymptotic Ai(£) function as 

<2mc\ 1 /^, .i -1 / 4 

\W) 

for large negative £. Since E — V(x) oc (x — xh), we see that 

rXH 


Ai ( o =— 


-1—1/4 r 1 r X H _ 

(x - x H ) J cos 1^— J \/2m[E -V(y)\dy - -J (3.76) 


r 1 C XH 

Ai(0 oc [E-V(x)]-V*cob [j_J a/2 m[E - V{y))dy ■ 


(3.77) 


and this is precisely the WKB wavefunction for suitably chosen coefficients A±. In other words, by choosing A± 
so that the WKB wavefunction becomes the asymptotic part of Ai(Q, we may then connect the inner wavefunction 
to the outer one for x > xh- Performing the same procedure at the left interface gives us 


rl f x 

i!>{x) oc [E- V{x)]- l/A cos - / y/2m[E - V{y)]dy ■ 

^ ■! XV 


(3.78) 


We now have two expressions for the inner wavefunction which should be equal for consistency. Using that 

rX _ rX H 

Jxv Jxv 


/ *x rXH r x H 

= — , we can write 

Xv Jxv Jx ’ 


rl f X H _ 1 f X H _ 

ip(x) oc [E - y(x)] _1/4 cos U / y/2m[E - V(y)]dy - - / y/2m[E -V {y)]dy + - J. (3.79) 

J X J Xv 


For the two wavefunctions to be equal, we thus obtain the criterion that 


1 

h 



a/2to[.E -V(y)]dy = nn-^. 


(3.80) 


Note that we have used here that the wavefunctions only need to be equal up to an overall sign ±1 since this 
sign can be taken care of by the normalization factor. In the above equation, n is an integer. Therefore, it can be 
rewritten as 


2 [ H V2 m[E - V(y)]dy = (n - \)h, n = 1,2,3,... (3.81) 

J Xv ^ 

The energies E satisfying this equation then determines the energy eigenvalues E — E n . Since the classical 
energy-momentum relation is E = p 2 /2m + V, we can write the above result as 


j) p(x)dx — (n + ~)/i, n — 0,1, 2,... 


(3.82) 
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where the integral is taken over one period of the classical motion (starting and ending up with the same 
momentum). This is the Bohr-Sommerfeld quantization condition. The quantization condition Eq. (3.81) gives 
better results the larger n is, but decent results may also be obtained for lower-lying levels n as well. For a 
harmonic oscillator, Eq. (3.81) in fact gives the exact eigenvalues for all n. 

We previously treated the case with two hard walls. If the potential instead has one hard wall, e.g. at x = xy, 
the wavefunction must vanish at x — xy. From our expression for the inner wavefunction obtained from the 
asymptotic behavior at x = xh, we see that the quantization condition becomes: 2 a/ 2m[E — V(y)]dy = 
(n — \)h. It is then possible to summarize our WKB results for the energy eigenvalues in the presence of hard 
walls as follows: 

• 0 hard walls: 2 /*/ ^/2 m[E - V{yj\dy = (n - \)h. 

• 1 hard walls: 2 y/2m[E - V(y)]dy = (n - \)h. 

• 2 hard walls: 2 0 ^/2 m[E - V{y)\dy = (n - 0 )h. 


Example 5. Triangular well. Let us apply the WKB method to the triangular well problem to see how well it 
approximates the eigenvalues. We have V(x) = oo for z < 0 and V(z) — Fz for z > 0. This problem thus has 
one hard wall and to use the quantization condition we have to set zy = 0 and zh = E/F, since zh was assumed 
to be located at the classical turning point. We get: 

f E / F 7 _ 1 

2 / y/2m(E - Fz)dz = (n - ~)h, (3.83) 

Jo 4 

which solving for E provides E — E n — |7r(n — ( ^ ) F 2 ' 2, . The numerical coefficients for n = 

1,2,3 are respectively 0.8%, 0.15%, and 0.08% off the exact results! 
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IV. TIME-DEPENDENT APPROXIMATIVE METHODS 

Learning goals. After reading this chapter, the student should: 

• Know the fundamental idea behind time-dependent perturbation theory, when it is valid, and be able to 
mathematically outline how to apply it on a quantum mechanical problem. 

• Know the fundamental idea behind the sudden approximation, when it is valid, and be able to mathematically 
outline how to apply it on a quantum mechanical problem. 


So far, we have studied weak or slowly (spatially) varying perturbations of a QM system. Now, we take the step 
to time dependent perturbations. Important applications for such a framework include EM radiation, spectroscopy, 
and laser physics. 


A. Perturbation theory 

We start by considering the case of a weak perturbation (in magnitude) and set out to derive the differential equa¬ 
tions governing the state coefficients. Let V(r,t) be a weak time dependent perturbation: 

H(r,t) = H°(r) + V(r,t). (4.1) 

Assume that the stationary states (r, t) = ip n (r)e~ lEnt / h for the unperturbed system H°(r ) are known. We 
thus have H°(r)ip n = E n ip n . The time evolution of the time dependent, non-stationary states 'P are governed by 

ihdt'li = ff'P. (4.2) 

We are not able to solve this in its exact form, and thus look for a perturbation method valid for weak V. Since the 

eigenstates for the unperturbed system is, as usual, assumed to be a complete and orthonormal set, we may expand 

>P(r, t)=y^ a k (t)ip k (r)e~ lEkt/h . (4.3) 

k 

Note that the coefficients {a^} have to be time dependent. Due to the normalization of 4/(r, t), we obtain 

Ek(t)| 2 = l. (4.4) 

k 

Inserting the expansion Eq. (4.3) into the time dependent SE, we obtain 

- ^E k a k ^tp k (r)e~ lEkt/h = y^a fc [iT° + V(r,t)\ip k (r)e~ lEkt/h . (4.5) 

k k 

We know use that H°^k — Ek^k to cancel two terms in the above equations and then multiply it with [^(r)]* 
and integrate over space, in order to obtain 

ih^Ee~ iEnt/n = Vnk{t)z~ iEkt/h . (4.6) 

k 

We here defined 

V n k{t) = j [ifj n {r)]*V(r,t)tp k (r)dr = {n\V\k}. (4.7) 

This is a known quantity since it can be computed from the known i/j n and V. With the short-hand notation 
uj n k = ( E n — Ek)/h, we may then write the result as 

D V nk e'“^a k (t), n = 1, 2, 3,... (4.8) 

k 
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Note that so far, we have not actually made any approximations: this coupled set of equations for the state 
coefficients is fully equivalent to the SE. 


If we now, however, do assume that V is weak, a & will only have a weak time dependence and we can approximate 
the solution by neglecting the time dependence of ak(t) on the r.h.s. of the equation. Doing so and integrating with 
respect to t gives: 


o) T 


^ ^ ft 

— V'afc^o) / V nk (r)e 

1,1 , Jtn 


r dr. 


(4.9) 


If the system starts out in state b at t = to, then a^{to) = Skb and we obtain 


a 8 (t) = rzf V sb (T)e luJebT dT (s / b). (4.10) 

in Jt 0 

This is a key result because it tells us that the probability that the system at a time t has made a transition from 
state b to s is P b ^ s (t) = |a. s (i)| 2 . Note that P b _> b = 1 - J2 s ^b p b^s- 


Detailed balance. 

Let us compare the probability for the transitions b s and s —► b. The first one was calculated above. The 
second one is 





Vbs(r)e luJhsT dr. 


(4.11) 


Since Ub s = (Eb ~ E s )/h = —w s b and Vb s = (6|V"|s) = (s|V|6)* = V * b , the amplitudes satisfy a^ s (f) = 
—a* s ^ b (t). Taking | ... | 2 , we see that 


P s ^b(t) = IUs(t). 


(4.12) 


In other words, to first order in time dependent perturbation theory, the probability for a transition is equal to the 
probability for the opposite transition. This result is known as detailed balance. 


Transient perturbations. 

Assume that we are dealing with a perturbation that is transient, such as a charged particle passing by an atom 
and exciting the electrons in the atom. This is actually the dominant mechanism that causes deceleration of an 
individual charged particle injected into a material. Since the coefficients {a s } stop changing after the perturbation 
has ceased, we may set t = oo and use to = — oo as the initial time. The transition probability from state b to s 
then takes the form: 

i 1 r°° 12 

Pb^s = \j l J e^ sbr V sb (r)dT | . (4.13) 

In the special case where V varies slowly in time compared to the period c u~ b , the integrand oscillates rapidly 
around zero and the integral become very small. If instead the perturbation varies in the same way as the "eigen- 
frequency" uj s b of the system, a resonance can occur which strongly influences the system. We now proceed to 
consider such a scenario. 


B. Harmonic perturbations 

An important special case is when the perturbation varies harmonically: 

V(r,t) = F+(r)e iwt + y_(r)e“ iwt . (4.14) 

The interaction between an atomic system and a radiation field in the form of EM waves has this form. The limit 
lj 0 corresponds to a constant perturbation. In order for V to be Hermitian, we need V+ = V -. Inserting this V 
into our result for the transition coefficients, we obtain: 

&b^s (t) — 


4(F + ) sb f e^+^dr + Ev-U fe^-^dr 

JO M Jo 

1 _ pi (uj sb +uj)t 1 _ pi(w s b-w)t 

(''•)*'■ ..+ ('' U ,., • (4-15) 


Download free eBooks at bookboon.com 


33 




INTERMEDIATE QUANTUM MECHANICS 


TIME-DEPENDENT APPROXIMATIVE METHODS 


We have set to = 0 as the reference point. To obtain the transition probability we need |a s | 2 . This gives | ... | 2 of 
the individual terms in Eq. (4.15) and a cross-term. Consider the last term oc V_ which after | ... | 2 gives: 


A WV \ ,2 sin 2 [(Eg -E h - fiw)t/2h] 

4|(V - )s&l (E s -E b - M 2 ' (4 ' 16) 

This contribution has a peak (with a height oc t 2 ) at the energy E s = E^ + huo. The width of the peak, on the other 
hand, goes like t -1 . As a crude approximation, we may then write 


4|(^-U| 2 


sin 2 [(£(5 — E b — hw)t/2h\ 

(Es - E~b - JkJji 


\(V-) sb \ 2 —S(E s -E b - M- 


(4.17) 


The term oc V+ similarly gives a sharp maximum at E s = E\, — huj. The cross-term, however, has no sharp 
maximum and thus for large times t we have the following transition probability per unit time: 


_\a b ^ s (t)\ 2 

^b^-s — , 


— \(V-) sb \ 2 5(E s -E b -huj) + —\(V+) sb \ 2 5(E s -E b + Hu). 


(4.18) 


A sketch of the true behavior of the | a^ s | 2 would look like this: 




The formula for uJb^s is useful when the energy spectrum or frequencies are continuous so that E^ ± huo = E s 
can indeed be satisfied. 
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Example 6. Continuous uj. An EM field ( e.g . visible light, X-rays) with a broad spectum of frequencies causes 
transitions between discrete atomic or molecular states. The resulting absorption spectrum consists of sharp lines. 

Continuous E s . A laser beam with fixed frequency uj can ionize an atom, causing a transition from a discrete 
bound state to a state in the continuous spectrum. This is the photoelectric effect. 

Continuous E 5 and E s . A typical scattering experiment consists of particles in a beam being perturbed by some 
target (i.e. potential) and changing direction. Such scattering is a transition between different states in continuous 
spectra. 


Transition to continuum states. 

Assume that we start out with a state with fixed E 5 and that the perturbation has a specific uj, while the final state 
s lies in a continuum of final states. Let there be p{E)dE energy states in the range (E, E + dE), such that p{E) 
is the density of states (DOS). For instance, in previous QM courses you may have shown that the DOS for a free 
particle in a volume Vq is 


p{E) = 2n{2m/h 2 ) 3/2 V 0 E 1/2 . (4.19) 

We may then compute the total transition probability to a state with energy close to E s . This is obtained by 


= —\V sb \ 2 p(E s ). 

The formula expresses that the transition rate uj increases both with the "overlap" \V s b\ element between the states 
and the amount of available states p(E s ). This is known as the golden rule. If one is interested in only a subset of 
the states with energy E s , such as particles moving in a certain direction which a detector can pick up, one simply 
uses the DOS for that subset. For a free particle, it would be the fraction dfl/Air of the total DOS: 


where E — p 2 /2m. 


p = 27r(2m//z 2 ) 3 / 2 Vb^ 1 ^ 2 ^ 7r — ^3 


(4.20) 


We can apply this to a scattering scenario, where a scattering potential V(r) acts as a perturbation on a particle- 
beam -0(r) — -^e iPi ' r / h . The aim is to find the probability per unit time for a transition to the final state 

— ~^e ip f' r ^ h . Here, Vo is the volume under consideration. We may treat this process as stationary 
(corresponding to uj = 0 ) and the energy before and after is thus the same: \pf\ = | p { \ = p. 



Pi 


V'(r) 


To obtain uj^s via the golden rule, we need the matrix-element: 

V fl = V /e i(p i- p / ) ' r/?i l/(r)dr. 

Vo J 


It follows that 




27r 1 

Tv 2 


J e KPi~Pf)-r/hv( r )dr 


x Vo^dSl. 

h 6 


A common way to measure scattering is the scattering cross section da: 

number of particles scattered into dQ per unit time 


da = 


incident particle intensity 


(4.21) 


(4.22) 


(4.23) 
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Quantitatively, the nominator is and the incoming particle intensity is the product of the particle density 
|^| 2 SB 1 /Vo and the velocity p/m so that 


da — cUj. 


mV{] 


(4.24) 


With our expression for c we end up with 


da 

m o [ F(r)e i(p *- p / ) ' r/ ' l dr 

2 

dQ 

2irh 2 J y ’ 



Later, we will regain this result using a different method. The above formula is the so-called Bom approximation 
for the scattering cross section. In chapter 13, we will also examine the range of validity for this result. 


C. Sudden approximation 

Let us now consider a scenario where the magnitude of the perturbation is not necessarily weak, but where the 
disturbance is switched on very abruptly. The simplest scenario where one can envision this is where H changes 
abruptly from H 0 to Hi at t = 0, where Hq and Hi are both time independent in themselves. Thus, we have 

* < 0 : -ffoV’fc = E° k < (4-25) 

where ^ are orthonormal and form a complete set, which is not necessarily discrete. Moreover, 

* > 0 : J?1 4>l = El4> l n (4-26) 

where {</>*} are also orthonormal and complete. The general solution of the time dependent SE is then: 

t<0:'H(t) = Y / C 0 k ^ 0 ke~ iE ° kt/h , 

k 

= (4.27) 

n 

Assume that (t) is normalized to unity, so that c° and d ° are the usual probability coefficients for finding the 
system in state ip® and </>* at t < 0 and t > 0. Now, since the time dependent SE is first order in the time coordinate, 
it means that T'(t) must be a continuous function of t. Thus, at t = 0: 

= (4 - 28) 

k n 

Take the scalar product with (j)\\ 

d 1 n = ^2c° k ((p 1 n \ip 0 k ). (4.29) 

k 

We now have a way to obtain the probability coefficients after the sudden change at t = 0, given that {c°} are 
known. In practice, the change from H 0 to Hi will take place over a short time interval r rather than being 
instantaneous. The simplest way to approximate this scenario is to use Eq. (4.29), but how large can r while Eq. 
(4.29) remains useful? 


We derive a simple criterion of validity. Let: 


H 


H 0 for t < 0 
< Hi for 0 < t < r 
Hi for t > r 


(4.30) 


where Hi is the time independent Hamiltonian during the intermediate period r. If {x}} denotes the complete 
orthonormal set of eigenfunctions of Hi , so that HiX\ — E\x\, then the general solution for the state coefficients 
{ d determining the state at t > r can be found in the same way as above, namely by using continuity of the 
wavefunction at t = 0 and t = r. It yields: 

d n = EE c 2^nlxi><xll^>e i(s "- B ' ,)T/B . (4.31) 

k l 
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Compare this with Eq. (4.29) which was obtained for r = 0, i.e. instantaneous switch from Hq to Hi rather than 
through an intermediate Hamiltonian Hi. If we set r = 0 in Eq. (4.31), the equations are equivalent as expected. 
However, if r ^ 0, the difference arises because of e 1 ( E n~ E i) r / h not being unity. If the sudden approximation 
is to be valid, we thus need r to be small compared to all the inverse energy differences h/\E* — E\\ so that the 
exponential is close to unity: 


T«fi/|££-£?|. (4.32) 

An interesting special case of the sudden approximation is when the system initially (t < 0) is in a particular 
stationary state ^e~ lEat ^ h where ^ is an eigenstate of H 0 . Then, c 3 = Ska and the probability amplitude of 
finding the system in eigenstate of Hi after the sudden change in the Hamiltonian has occurred is simply 

4 = (4>n\i>a)- 


Example 7. Beta decay of the tritium nucleus. A tritium atom consists of a nuclear 3 H (one proton + two 
neutrons) and one electron. It is unstable and decays into the nucleus 3 He (two protons + one neutron): 

3 H 3 He + e _ + v e . (4.33) 

Assume that the tritium atom is in its ground state before the /3-decay of 3 H takes place. The question is now: 
what is the influence of the decay on the atomic electron? 

We first note that in the /3-decay process above, the electron is emitted from the nucleus with, in most cases, an 
energy of several keV. This means that its resulting velocity v is much higher than the velocity vq ~ c/137 of 
the atomic electron in the ground state of tritium. If ao is the Bohr radius, the emitted electron will leave the 
atom in a time r ~ ao/v. This is much shorter than the period T = Zttclo/vo associated with the motion of the 
atomic electron. Thus, we can justify a scenario where the nuclear charge "seen" by the atomic electron changes 
instantaneously from Ze to Z'e where Z = 1 and Z' = 2. The relevant Hamiltonians we have to work with are 
then: 


H(t<0) = H o = -^-V 2 

Zm 

H(t>0) = H 1 =-^-V 2 

Zm 

with m being the mass of the atomic electron. We neglected here the recoil effect on the nucleus, since its mass 
M m. The eigenfunctions of Hq and Hi are hydrogenic wavefunctions and thus known. Since the tritium atom 
is assumed to initially be in its ground state (quantum nunbers n = 1,/ = 0, ra = 0), the probability coefficients 
dn'i'm' °f finding the atomic electron in a discrete eigenstate ( n'l'm') of Hi at t > 0 is: 



d 


i 

n'l'm' 


TO = 


/( 


^J)( r )) ^ 


(Z=l) 

100 


( r)dr 


(4.35) 


where (r) is a hydrogenic wavefunction with atomic number Z. We know that i/nZm ( r ) = ^ni ; ( r )^/m (#, 4>), 

and from the orthonormality properties of Y/ rn one may verify that the only non-vanishing probability coefficients 
dn'i'm' are tfi° se belonging to the s-states (/' = m' = 0): 


POO 

4' 00 =/ R ( n % =2 \r)R[ Z 0 =1 \r)r 2 dr. (4.36) 

Jo 

For the particular case in! — 1, we obtain 

d\ oo = 2 7 /V 3 f dr r 2 e~ 3r / a ° = (4.37) 


Hence, the probability that the 3 He ion is found in its ground state is P/q 0 = \d\ 00 1 2 ~ 0.702. The total probability 
for the ion to be either excited or even ionized is then 1 — Pj q 0 — 0.298. 
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V. ADIABATIC APPROXIMATION AND THE BERRY PHASE 

Learning goals. After reading this chapter, the student should: 

• Know the fundamental idea behind the adiabatic approximation, when it is valid, and be able to mathemati¬ 
cally outline how to apply it on a quantum mechanical problem. 

• Be able to explain what the Berry phase is and in which scenario it is of relevance. The student should also 
be able to give concrete examples of systems where the Berry phase plays an important role. 


The lecture notes forming the basis for this chapter follow roughly the same structure as the corresponding chapters 
in "Quantum Mechanics" by Bransden & Joachain. 


A. The adiabatic approximation 

The perturbation method we have initially considered was based on the assumption that the magnitude of the time 
dependent part of H has been small. We now present a new approximation where the key parameter is the rate of 
change of H . Start by assuming that H varies very slowly with time, i.e. the completely opposite scenario of the 
sudden approximation. One should then expect that the approximate solution of i hd t ^ = can be obtained 

in terms of the eigenfunctions of the "instantaneous" Hamiltonian H(t) so that 

= (5.1) 

at any given time t. Physically, what we are stating here is that if H(t) changes very slowly, a system which at 
t = to is in a discrete non-degenerate state with energy E a (tf) is very likely to be in the state f> a (t) with 

energy E a (t) at a later time t, i.e. without making any transition. We now proceed to prove this adiabatic theorem , 
using the method of Born & Fock from 1928. 
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In fact, the original formulation of this theorem was: 


A physical system remains in its instantaneous eigenstate if a given perturbation is acting on it slowly enough and 
if there is a gap between the eigenvalue and the rest of the Hamiltonian’s spectrum. 


We can think of this physically as follows: if a QM system is exposed to a slowly changing perturbation, the 
system has time to adapt to the perturbation. In contrast, if the perturbation occurs very rapidly there is not 
sufficient time for the system to adapt, so that the probability density remains unaltered. 


Assume that is known at t = to- Now, expand for t > to in the instantaneous eigenfunctions 

f* 

/to 


V = 


y2c k {t)ip k {t)sxp\ - (i/ft) [ E k (t')dt ’']. 

L Jtn J 


(5.2) 


We assume that {'ipk} form an orthonormal and complete set, as usual. The energies Ek(t) are non-degenerate and 
form a discrete spectrum. Note that the "energy levels" is just a formal name, since energy is not strictly speaking 
conserved for a time dependent H. Inserting the above expansion into the time dependent SE provides: 


(c-fcV’fc + Ckdtipk - (i/n)c k 'tp k E k ^exp^ - (i/H) j E k (t')dt'^ = H(t) ^ c k ip k exp [ - (i ,/H) j E k (t')dt'^. 

k k 

(5.3) 

There is a cancellation of the last term on the l.h.s. by using Eq. (5.1). Now, do the following: 

• Multiply with ip b (t) (which is part of the set {^ (£)}). 

• Integrate over the coordinates of the system. 

• Use that (rpb\^k) = &bk- 


This gives: 


Cb(t) = -y^c fc (t) ex p{^ [E b (t’) - Ekit'^dt'^^bldtipk). (5.4) 

This is a set of coupled first order differential equations for all the coefficients c& (t). The diagonal terms can be 
removed as follows. Consider first ak(t) = {4>k\dt^k)- Use the normalization ('ipk(t)\'ipk(t)} = 1 and differentiate 
it with respect to time: 


(dt^kli’k) + {i>k\d t il>k) = [otk{t) ]* + a k (t) = 0. (5.5) 

Thus, ctk (t) is purely imaginary, so that we may write dk(t) = i /3k(t) where f3 G 3^. Now, define 

4(t) =c k (t)e if *o Mt ' )dt '. (5.6) 

Differentiating c' b with respect to time in order to get c' b , we obtain: 

4 = - £4(f)e*pU f[E b {t’) - E k {t')\dt'}w b \dtili), (5.7) 

k^b Jto 


We defined ip' k (t) via: 


c k (t)ipk(t) * c k {t)e^o Mt )dt ip k (t)e 1 fto 0k{t )dt =4(t)^(t). (5.8) 

If we assume that the phases of the eigenfunctions ipk are arbitrary at each instant of time, we can do this change on 
all ipk- Assume from now on that this change has been made and we thus omit the ' notation. It is important to note 
that this assumption is invalid for the case of cyclic systems , but we return to this issue later. 
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Looking at Eq. (5.7) again, we examine ('ipb\dt' l Pk) for k ^ b. Differentiate w.r.t. time on the equation H (t)'0/ c (t) = 
and obtain: 


d t H^ k + Hdt'ipk = d t E k ^ k + Ekdt'ipk- (5.9) 

Using the notation ('ip a \'ipb) = f[ipa(r)]*'ipb(r)dr and taking the scalar product with ipb, this gives: 

(il>b\dtH\ fa) + (HH\d t ^k) = E k ('ip b \d t 'ipk)» (5.10) 

Using that H is Hermitian, we obtain for the second term that 

(ipb\H\d t i>k) = Eb(i>b\di>k), (5.11) 

and plugging this back into Eq. (5.10) gives: 

<«»,« =Mt. < 512> 

We introduced the notation ( d t H)bk = {'ipb\dtH\'ipk) and vbk{t ) = [Eb(t) — Ek(t)]/h, b / k. Thus, cObk ^ 0 
always since we assumed that the energy levels were non-degenerate. 


If we now use our obtained results and plug them back into our expression for the coupled equations for the (t) 
coefficients, we obtain (keep in mind that we omit the primes, as explained previously): 


cb{t) = Y 

k^b 


Ck(t) 

tkJbk{t) 


(d t H) bk e iftt o Ubk{t ' )dt '. 


(5.13) 


This system of equations then determine the Cb coefficients, which in turn determine the wavefunction via Eq. 
(5.2). This is a convenient starting point to make approximations, especially when d t H is small (slowly varying 
Hamiltonian in time). If d t H = 0, then the solution is seen to be simply Cb =constant for all b. If d t H is finite, but 
small, we can try to solve Eq. (5.13) by setting all c& on the r.h.s. to be constants. Assume that the system initially 
(t = to) is in a state a • We substitute the values = Ska in the r.h.s. and get 

C b (t) = nr 1 w^{t){d t H) ba e ^o Uba{t )dt , b ± a. (5.14) 

For b = a, we get c a = 0 in this approximation. Now integrate the above equation with the initial condition 
Cb(t < to) = 0 (b 7 ^ a) and obtain: 


ft 

r P* i 

c b (t) = nr 1 

dt'w b a[dt'H(t ')} b0 exp i / u ba {t")dt"\, (b^a). 

Jt 0 

L Jto J 


This is the result for the adiabatic approximation for the probability amplitude q>(£). We should expect this result 
to yield a small |o>(t)| in order to be valid. Thus, Pb a (t ) = |q >(£)| 2 denotes the transition probability from the 
initial state a to state b , and we must have Pbaif) 1 . 

A crude estimate is to assume that ujba and d t H are time independent. We then obtain 

c b (t) ~ (m)- 1 w 6 a 2 (a t i?) 6 a (e i “ i '“ (t -‘ o) - 1 ), 

P ba (t) ~ 4h- 2 ^ a 4 \(d t H) ba \ 2 sm 2 [uj ba (t - t 0 )/2\. (5.15) 

This probability behaves reasonably as time increases since it merely oscillates, and the upper bound is [since 

sin 2 (it) < 1 ]: 


Pba(t) < 


±\(d t H)ba\ 2 

h2 “ta 


The adiabatic approximation is thus valid if 


\(d t H) ba \ 2 « 


Yba 

4 


(5.16) 


(5.17) 
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Example 8 . Charged harmonic oscillator in a time dependent electric field. Let us now try out the adiabatic 
approxmation and consider a charged particle subject to a linear harmonic oscillator potential and a spatially 
uniform, time dependent electric field £(t). The Hamiltonian is then: 

H(t) = + _ q£ ^ x = + _ a( j.y 2 _ bka 2 {t). (5.18) 

Here, we defined a(t) = q£(t)/k. We can physically interpret this H(t) as, at a given time t , describing a 
standard harmonic oscillator with frequency uj — yjk/m, but displaced equilibrium position to x — a(t). The 
term — ^ka 2 (t) is just a constant. The instantaneous energy eigenfunctions are thus obtained as: 

/ OL \ 1/2 o o 

^ n wL '/Tr2 n n\ ) e M~ a ( x ~ a ) / 2 \ H n[a(x - a)}, (5.19) 

where a = \Jmoj/h. The corresponding instantaneous energy eigenvalues take the form 

E n (t) = (n + l/2)fkj — ka 2 (t)/2, n = 0 , 1 , 2 ,... (5.20) 

The angular frequencies uo nn > = [E n t(t) — E n (t)\/h = (n' — n)uo are thus independent of time and equal to the 
unperturbed value. Assume now that 

• 8 (t) is applied at t = t o and that it varies slowly. 

• The harmonic oscillator is initially in its ground state (n m 0). 

We want to compute the probability that the system is in an excited state at t = t\. First, note that d t H = — kax 
with a = (q/k)(d£/dt). To find the transition probabilities, we will need (as derived previously) 

(ip b \dtH\iP o) = (d t H) b o. (5.21) 

In effect, we need to compute matrix elements of the type 

x b o = (^& | a# o). (5.22) 


It can be shown that all these matrix elements vanish when 6/1, while for b = 1 we have x±o = h/(2muj ). 
From the general expression of P ba {t ) derived previously, the only non-vanishing transition probability is 0 1 . 

Inserting our expression for d t H and ccio = uj, we get 


Pio(h) = |ci(ti)| 2 



r 

Jto dt 


The slower the £-field varies, the smaller the transition probability 0 —)> 1. 


(5.23) 


B. The Berry phase 

When we discussed the adiabatic approximation, it was assumed that the phases of the eigenfunctions (t ) are 
arbitrary at each instant of time. This was in fact generally accepted up to 1984 when M. V. Berry showed that: 


In a cyclic system where the Hamiltonian at time tf is the same as at time to, 
there is a relative change in the phase between ipk(to) and 'ipkftf) which 
cannot be removed by a phase transformation and thus has observable consequences. 

To show this, consider the case where H (t) varies so slowly that the system remains in its initial non-degenerate 
state with energy E a (t) and eigenfunction / a (t ). According to our previous treatment of such an adiabatic sce¬ 
nario, the approximate solution of i^^ = H(t)^ is then: 

*(i) = JtE ° {t ' )dt '. (5.24) 
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Since ^ (t) is in the state 2 p a (t) at t = to, we can set c a (to) = 1. Moreover, since ^ a (t) should also be normalized to 
unity, we can write generally at t ^ to that c a (t) = e 17a( ^ with 7 a (t) G 5? and 7 a (^o) = 0- Now, So 

is the usual dynamical phase factor whereas the SE gives us the following equation for 7 a (t): 

17 aWiW = -dt'ipait), (5.25) 

with the solution 

7a(f) = i f (^a{t')\dt'^a{t')}dt f . (5.26) 

Jto 

If the system is cyclic, then H(tf ) = iif(to) since the Hamiltonian returns to its value at t = to at a later time 
t = tf. This also implies that E a (tf ) = E a (to) and ip a (t o) = The iterry phase is the accumulated phase 

change from £ 0 to tf : 


7a = i 
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This turns out to be a physically observable quantity, thus with experimentally verifiable quantities. Importantly, 
the Berry-phase is gauge-invariant and a key message in the 1984 paper by Berry was that any gauge-invariant 
quantity is in principle observable. In fact, let us see what happens if we try to eliminate % by transforming 
'ipa —>• = ^ a e ir?( ^. Under this transformation, the Berry phase becomes where: 

f« = i A*(Oi**(0>* 

Jt 0 

= i [ - ( ~77jr~dt' 

Jto Jto at 

= %-v(tf) + v(to)- (5.27) 

Since a (tf) = 'ipaito), it follows that p{tf) — p{t 0 ) — 2 tt n, n = 0,1, 2,... Thus, e^ a = e 1 ^ cannot be removed 
by a phase transformation. Strictly speaking, the Berry phase j a is gauge invariant up to an integer multiple of 2i r, 
whereas e^ a is absolutely gauge invariant and thus related to physical observables. 

The H(t) may be time dependent through a number of parameters, each of which slowly vary with time. A 
common example: components of an exemal electric or magnetic field which interact with the system. Consider 
the case where H(t ) depends on t via three parameters Ri(t), R 2 (t), i? 3 (t): 

H(t) = H[Ri(t)], i = 1,2,3. (5.28) 

Since H(tf ) = H(to) for a cyclic Hamiltonian, we have Ri(tf) — Ri(to). In vector notation, R = (Ri, R 2l R3), 
we can then write the Berry phase as 


7a = if(MR)\VnMR)) ■ dR (5.29) 

where V# is the gradient in parameter space and the closed integral is taken along the curve C in parameter space. 
We define the Berry connection: 


A(r)=i(MR)\VRMR))- (5.30) 

Since it depends on the closed curve C, the Berry phase is often called a geometrical phase. Such phases arise also 
in a number of non-adiabatic situations as well - not only the strictly adiabatic context discussed here. In fact, a 
generalization of Berry’s phase is the Aharonov-Anandan phase. Suppose a system evolves according to the SE, 
but that the change in H is neither adiabatic or cylic. The system can then still exhibit a geometrical phase: all 
that is needed is a cyclic evolution of the state of the system. Such a cyclic evolution defines a closed path C in 
the Hilbert space of the state. Regardless of whether this evolution is adiabatic or not, it leaves the system with a 
dynamical phase which depends on the Hamiltonian, and a geometrical phase which depends on the path C. 

We also remark that by applying Stoke’s theorem, we have: 

7 a = A(R)-dR = J J B ■ dS (5.31) 

where S is the surface bound by the closed path C and 

B = Vh x A(R) = Berry curvature. (5.32) 

When treating the Aharonov-Bohm effect, we will see a concrete example of the physical consequences of these 
kind of geometrical phases. 

In closing, we comment on whether or not the Berry/geometrical phase is reconcilable with the commonly stated 
fact that the overall phase of a quantum system is unobservable. Yes , because the Berry phase expresses the total 
phase change acccumulated during a cycle (either a cyclic evolution of the state or the Hamiltonian). We assumed 
for simplicity in our derivation that we know the phase at t = t 0 was j a (t = t 0 ) = 0, but generally the Berry 
phase expresses the phase difference : 


la = l(t = tf) - 7 a {t = t 0 ) = i / (lp a (t')\dt>1pa(t'))dt'. 

■Jtn 


Now, phase differences are certainly observable, even if the phase at a given time is not. 


(5.33) 
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Example 9. Relative phase in a superposition of states. A quantum system may be in a superposition of states. 
Then, the relative phase between the state is observable: 

ip = ipA+ipB = \tpA\e iaA + |V’s|e iaB = {\ipA\e l( - aA ~ aB) + \ip B \)e iaB . (5.34) 

It is clear that |^| 2 depends on Aa = a a — &b- More generally, consider two paths R(t) and R! (t) with the 
same end-points: R(to) = R'(to) and R(tf) = R'(tf). If the system now evolves in a superposition of states 
| ^i[R(t)]) and | i/ji[R' (t)]) , then the relative phase of this superposition (analogously to Aa above) contains two 
parts at t = t /: 

• The relative dynamical phase. 

• The Berry phase: the difference between the Berry connection A integrated along R and A integrated along 
R! . In effect, it is the circular integral f c A (r) • dr where C is the closed path comprised of the paths R and 
R'. 
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VI. QUANTUM MECHANICAL SCATTERING THEORY 

Learning goals. After reading this chapter, the student should: 

• Be able to explain what the scattering cross section is and what it provides information about. 

• Understand how scattering can be formulated as a stationary problem in quantum mechanics and set up the 
corresponding asymptotic wavefunction, as well as how to modify this for scattering of identical particles. 

• Know the underlying idea behind the Born approximation and the method of partial waves, and explain when 
these two frameworks can be used. 

• Know what the optical theorem states physically and the principle from which it is derived. 


A. Intro to scattering cross section 

Particles that are incident toward a scattering center - e.g. a different particle - will in general be deflected due to 
the interaction. The distribution of angles of deflection will depend on the details of the setup. Experimentally 
measuring this distribution will provide us with information about the type of interaction that is in play. We 
distinguish between 

• Elastic scattering: the kinetic energy of the scattered particles is preserved. 

• Inelastic scattering: kinetic energy is not conserved, e.g. due to a photon taking off with part of the energy. 

We will consider elastic scattering with the two same particles before and after. For a potential V(r% — r 2 ), we 
know that this can be reduced to an effective one-body problem where only the relative motion of the particles 
matter. We shall initially consider the scattering problem in the corresponding center-of-mass (CM) frame and 
later see how the results are expressed in the lab-frame. 

Consider the following idealized model. 



A uniform flux of particles with density j m is incident on a scattering center S. A detector counts particles scattered 
into solid angle dVt — sin 0d0d<j) enclosing the direction ( 0 , </>). The incident axis is 0 = 0. We have previously 
(chapter 4) defined 


da # particles scattered into d$d per unit time 
dQ dQ • ji n 

Since j m — \j in \ is the number of particles incident per time and area, inspection shows that da/dVt has dimension 
area. The total scattering cross section is obtained as: 

/ j r 2ir pir j 

— -dfl= / / —-sin OdOdcj). (6.2) 

dM Jcj>=0 J 9=o 

The dimension of a is area as well. It corresponds to the total area the incident particles are passing through that 
will cause scattering. Put differently: imagine an area of size a in the incident flux of the particles. The number 
of particles passing through this area will be equally large as the number of particles that ultimately are scattered 
in some direction. For instance, for scattering between hard spheres of radius R, we have a = AtvR 2 . For point 
particles scattering on a sphere of radius R , we would obtain a = ttR 2 . We will primarily stick to central potentials 
V(r) = V(\r |) which thus do not depend on the azimuthal angle 4> due to symmetry. 
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B. Briefly about the classical scattering cross section 

We will look at the classical case prior to the QM treatment. For a central potential (spherically symmetric), the 
trajectory of the particle will lie in a plane and is characterized by two quantities: 

• The velocity v 0 far away from the scattering center. 

• The impact parameter b (see figure below). 



The scattering angle 0 is determined by vq and b, as we show below. Assuming that V (r —oo) = 0, energy 
conservation gives 


E = = ^mr(t) 2 + V[(r(t)] — ^m(r 2 + r 2 a 2 ) + U(r), (6.3) 

and conservation of angular momentum (due to rotational symemtry) gives L = mbv 0 = m\r x v\ = mr 2 a where 
a characterizes the angle of the instantaoues point along the trajectory. Now, express E via L : 


E = W 2 + ^ + v ^' 

and use that r = ^ ^2 • Combine these two equations to obtain: 


da = =b 


L/r 2 


sj2mE - 2mV{r) - L 2 /r 2 


dr. 


(6.4) 


(6.5) 


We integrate this expression from r = r m \ n to r = 00 . Since r m i n by definition is given by dr/da = 0, the ^/TTT 
must be zero there. The d= sign indicates whether da/dr is positive or negative, which depends on the nature of 
the potential. 


For repulsive forces (as shown in our previous figure), the change in a when going from r m i n to r = 00 is then 
(tt — 0)/ 2, while for attractive forces it would be (n + 0)/2. The integration thus provides: 


r° 

(tt ± 6») = / 

J r m 


L/r 2 


\J2mE — 2 mV (r) — L 2 /r 2 


dr. 


Using L — mbvo and E = \mv\, we get: 


(*±«o= f 

J r n 


b/r 2 


* v 7 ! - V^E- 1 - b 2 r~ 2 


dr. 


(6.6) 


(6.7) 


This relation defines the connection between b and 0: knowing b of the incident particle, we can compute 0. A 
certain interval db corresponds to an interval dO according to: 


db = 


I db(6) 
I dO 


d0 = 


| db{9) 

i dVt 

dO 

\ 27r sin 0 


( 6 . 8 ) 
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Absolute values have been introduced since db/dO is often negative: a large impact parameter causes a smaller 
scattering angle. To compute the cross section, we note that there is an area 2irb db contained between the impact 
parameters b and b + db. The number of particles passing through this area per unit time is jm^nb db. According 
to our above treatment, the number of particles scattered into dfl per unit time is then: 


jm^Trb db — j\nb 


db 

dO 


d£2 
sin 6 


Using our definition of the differential scattering cross section, we obtain the final result: 


(6.9) 


da_ _ b{0) I db(0) | 
dQ sin 0 1 dO I 


To compute da/dQ, one thus has to identify b(0). 


( 6 . 10 ) 


Example 10. Coulomb-potential scattering. Two charges Ze and Z'e interact via the well-known V (r) = 
z ^ e e r . To find b(0), we then have to compute: 


1 r°° b/r 2 

-(tt±0) = / , ' =dr. (6.11) 

2 J rm „ i/I - ZZ'e 2 /{47re 0 Er) - 6 2 r“ 2 

Upper sign: attraction (ZZ 1 < 0). Lower sign: repulsion (ZZ 1 > 0). Introducing x = b/r and a = 
ZZ'e 2 /(8Tre 0 Eb), we obtain 


i(„±0)=/' /ig ~ 1 ' ; ** 

2 J 0 \/l +3 2 - (x + 5) 2 


( 6 . 12 ) 


As commented on previously, x miiX (or equivalently r m i n ) is determined by yCTT as 0. This integral can be evaluated 
and yields (after rearranging the equation): 


b = 


ZZ'e 2 
^ 8neoE 


cot(0/2). 


(6.13) 


Thus, after differentiating b with respect to 0, we obtain the differential scattering cross section (known as the 
Rutherford cross section): 

da = / 1 

dQ VI67 teqE/ sin 4 (6>/2)' 

For small angles 0, da/dQ oc I/O 4 , causing the integral to diverge 

rTT dp 

a = I 2tt s'm 6dO ^ 00 . (6.15) 

Jo d^ 


Small 0 corresponds to large impact parameter b, and so this result reflects the fact that the Coulomb-potential has 
infinite range, causing scattering of all incident particles. 


More generally, a potential V(r) 7^ 0 for r < a and V(r) = 0 for r > a will have a = 7 ra 2 classically. In QM, 
this is different: a can be infinite or finite when the range a 00 , depending on how fast the potential V (r) goes 
to zero when r —)> cxd. The Coulomb-interaction is arguably the most important interaction in physics, and we will 
treat it quantum mechanically in what follows. 


C. Scattering as a stationary problem 

We have seen an example of such a scenario (scattering as a stationary problem) in elementary QM courses: 
scattering on a potential barrier in ID. We thus seek the solution of the time independent SE: 

[^ v2 + nr)]#r)=£#r), (6.16) 
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and use appropriate boundary conditions for the solution 'ip in order to describe an incident flux of particles and an 
outgoing stream of scattered particles. Let E = h 2 k 2 /2m and U (r) = 2 mV ( r)/h , and obtain 

(V 2 + k 2 )ip = Uip. (6.17) 

Near the scattering center S, the behavior of ip may be complicated. However, for r —> oo we can neglect U (r) 
and the resulting free particle solution should then describe an incident plane-wave and radially outgoing particles: 

Ip — V’in + V^scatt for T > OO. (6.18) 

Outgoing 
spherical wave 


Inc. plane-wave 


S 


r = 0 



The incident wave: ip{ n = Ce lk r where C is a constant and hk = pi = \]2mE. 


The scattered wave: Must be a spherical wave with the same energy (wavenumber k) as the incident one, hence 

VWt = Cf(e,<P)^L. 
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The factor 1/r ensures that the outgoing current density j sca tt is proportional to 1/r 2 . Now, since the surface 
element corresponding to a solid angle element dfl increases with distance as r 2 dfl, this means that for large 
distances it is the same number of particles passing through any cross-section of the given, solid angle element, as 
expected and shown in the figure. 



The factor /(#,</>) determines the angular distribution and is known as the scattering amplitude, which in turn is 
determined by V (r). We shall return to this issue. Let us now determine the differential scattering cross section 
expressed in terms of /(0, </>). We know that a quantum mechanical probability current density is given as: 

j = Re{i/)*T v-0}. (6.19) 


This means that 


jin = R e{^in AvV>in} = ^|C| 2 , 

jscatt-|C|W^)| 2 -^- 


Since the definition of da is: 


_ JscattT dQ 

jin 


( 6 . 20 ) 


( 6 . 21 ) 


we obtain by insertion: 


da 

dn 


l/(M)l 2 - 


Since C turned out to be insignificant, we set C — 1 in what follows, for simplicity. Note how we have obtained 
an expression for da/dfl using only the asymptotic (large r ) form of the wavefunction. Summarizing the idea so 
far: 


• We seek a solution of the time independent SE 

(V 2 + k 2 )il){r) = U(r)ip(r), (6.22) 

• For large r, the solution should have the form 

+ — • (6.23) 

r 

• The diff. scattering cross section is then: 

^ = l/(M)| 2 - (6.24) 

The remaining task is to determine /. First, a few comments: 

1. We have here assumed elastic scattering, thus neglecting the possibility of the particles making energy tran¬ 
sitions during the collision. 

2. We have assumed free particle behavior at r —» oo. For potentials with infinite range, it is essential how 
fast V 0 when r oo. It turns out that if rV (r) —» 0 for r —» oo, we obtain the free particle 
asymptotic behavior. The Coulomb-potential does not satisfy this and we shall later see how this influences 
the asymptotic form. 
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3. In a real experiment, one does not send in an infinite plane-wave toward S, but rather a beam that is colli¬ 
mated (focused) in space. The localized nature (let k be the width of the beam in the i-direction) causes an 
uncertainty in the momentum A pi ~ h/U. However, l t will usually be much larger than atomic distances, 
and the lack of precision in momentum should thus be negligible compared to the change in momentum 
(direction) caused by the potential. We may thus disregard finite-size effect of the beam and model it with a 
plane-wave. 

4. In practice, one scatters particles on a macroscopic collection of particles rather than a single scattering 
center S , e.g. a gas of particles. To use our approach, the thickness of the target has to be large enough to 
cause sufficient scattering intensity, but small enough to keep multiple scattering at a minimum. 


D. Integral equation for the scattering amplitude 


Our strategy here will be to transform the SE into an integral equation in order to incorporate the correct asymptotic 
behavior ip(r) ~ e lk r + f(0 , <j))e lkr /r. This transformation is done with the aid of the Green function G(r — r'), 
defined by 


(V 2 + k 2 )G{r — r') = S(r — r'). 

If G is known, then the SE is equivalent to: 

^(r) =-0 o (r) + J G(r — r')U(r f )'ip(r')dr f , 


(6.25) 


(6.26) 


where t/’o is a general solution of the homogeneous equation (V 2 + /c 2 )^o = 0. To establish this equivalence, 
operate with V 2 + k 2 on Eq. (6.26): 


(V 2 + k 2 )'ip = 0 + J S(r — r / )(7(r / )'0(r / )<ir / = U(r)i/j(r) 


which is precisely the SE. The second order differential equation for G has two independent solutions: 

e ±i/c|r— r'\ 


G(r — r') = — 


47t |r — r'\ * 


(6.27) 


(6.28) 


To see this, it is sufficient to demonstrate that G(r) — — satisfies (V 2 + k 2 )G{r ) = S(r). Since V 2 = 
+ r Jr + angular derivatives, we obtain for r > 0: 

d e ±ikr 

v r 


dr r 


/±ifc _ j_\ + ikr 


(6.29) 


and 


dr r 


-( 




)e ±ifcr . 


(6.30) 


Combined, this yields (V 2 + k 2 ) - ' - = 0. This is consistent since S(r ) = 0 for r / 0. To justify the presence of 
the (5-function, we integrate (V 2 + k 2 )G over a spherical volume with radius R by using the formula: 


/ X7 2 Gdr = [ S7G-df = 4itR 2 (d r G) r=R 
Jv J f(V) 


(6.31) 


>f(V) 

where /(V) is the surface of the volume V. We then get: 


/ (V 2 + k 2 )G = 4nR 2 ( 

Jr<R ' 


± ke ±lkr e ±ikr \ 
-4? tR + 4t rf? 2 / 


+ k 


l 


R gdzi/cr 

—47rr 


4:irr 2 dr 


±ikr + e ±ikr + k 2 + g±i kr _ = L (6.32) 

L \k k z J o 


— =pi kRe 

Since the integral over (V 2 + k 2 )G is 1 for any finite radius R, we must have (V 2 + k 2 )G = S(r), which 
completes the proof. Now that we know exactly what G(r — r ') is, we can insert it into Eq. (6.26) in order 
to find ^(r). The choice of (r) is dictated by the boundary conditions and the fact that it has to satisfy 
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(V 2 + k 2 )^o = 0. We therefore set (r) = e lkr since U = 0 should give precisely this wavefunction as there is 
no scattering in this case. 

Secondly, we choose the + solution for G so that we recover the correct form e lkr jr for large r. With these two 
choices, the solution for then becomes: 


ip(r) = e lk r - j- / t- —U(r')ip(r')dr'. (6.33) 

47 t J \r — r \ 

Note that \r — r'\ ~ r for large r. If we now look at the behavior of the above equation in the large-r limit, we 
will be able to identify /(#, (j)) by comparing directly with the form ^(r) e lkr + e l/cr /(6 > , (j>)/r. 

First, we do the large-r expansion more accurately. We have: 


- / • v r frM^ 

k\r — r'\ = kJr 2 — 2 r ■ r 1 + (r ') 2 = kr\ 1 - ~ - h V 0 = kr — k' • r' + 0(l/r), (6.34) 

y 

where /c 7 = kr/r points in the direction that the particle has after scattering. The momentum of the final state is 
thus Pf = hk'. Note that \pj\ = H\k\ = h\k'\: conservation of momentum. Using our expansion, the integral 
equation then takes the form 


i p i kr r 

Mr) ~ e ik r - -/ e~ ik r U(r')Mr')dr'. 

47 t r J 

Now, we can finally read out the scattering amplitude: 


/(M) = -^- J e lk ' r 'U(r , )ip(r , )dr l . 


(6.35) 
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It may appear as if we still have not accomplished much since this expression still depends on the unknown r ). 
However, it turns out that our formulation is still useful, because we have now set up the problem in a manner 
which makes it suitable for an iterative treatment. 


E. Born-approximation 

If the scattering potential is weak (and we shall later specify what this means quantitatively), we can solve our 
integral equation for ^ by iteration. The n-th approximation is obtained by using the (n — l)-th approximation on 
the r.h.s. of 


1 r e ik\r-r'\ 

^(r) = e lkr — —- / - —U(r') / ip(r')dr'. (6.36) 

47 t J \r — r'\ 

The most basic approximation, (r), is simply to set it equal to the incident plane wave. Thus, we obtain 

^°\r) = e ik - r , 

* W (r) = e ikr ~IJ 

t/>( 2 )(r) = e ik r — 2- J | 

^ (3) = ... (6.37) 

and so forth. In this manner, we can obtain better and better approximations for /(#,</>) by inserting approxima¬ 
tions for r ). This expansion is known as the Born-approximation and one often settles for the lowest order 
correction. We now examine this in more detail. 

First order Born-approximation. 

Using = e lfc-r , we obtain 


/B(M) * I ei{k ~ k ' yru ( r ) dr > ( 6 - 38 > 

where the B superscript indicates that this result has been obtained in the first-order Born-approximation. Intro¬ 
ducing q = k' — k and reinstating U = 2mV/h 2 , we get: 



In other words, the scattering amplitude f B is essentially the Fourier-transform of the potential. The physical 
meaning of q is that it is the momentum-transfer during the collision: q = 2k sin(0/2) according the figure. 



k 


Note that so far, we have not made any assumption about the potential being spherically symmetric. If it is, 
however, we may simplify the expression for f B as follows. Let V(r) = V (r) and let r point along the polar axis. 
We then obtain 


ft 

Jv= 0 J C=0 




'd( sin vda — 2 tt 


i qr cos is u= 


i qr 


v =7r 47rsin(gr) 


is =0 


qr 


(6.39) 


The result for f B is then: 


2m f°° 

f B (0) = -—z- / V(r) sm(qr)rdr. (6.40) 

n 2 Q Jo 

Note that ( and v are just integration variables without any special significance. In the forward scattering case 
( 0 = 0), f B becomes independent on q and thus the energy of the particle. This might appear strange at first 
glance and in fact it is physically incorect: it is an artifact of the perturbation expansion of the Born treatment. 
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Going to 2nd order in the perturbation V (r) fixes the problem. The point is that one must account for interfer¬ 
ence between the incoming wave and the outgoing wave for 0 = 0 in order to correctly describe forward scattering. 

In contrast, this interference is not as important for 0 ^ 0. For such directions, the oscillating term sin (qr) renders 
the integral small when qa 1 where a is a measure for the spatial range of the potential. This means that for 
high energies (large k), the differential scattering cross section is very small except when qa = 2kasm(0/2) ~ 1, 
which for large k means 9 ~ 1 /(ka). We then conclude that high-energy particles do not change their direction 
much, keeping their trajectory close to 0 ~ 0. 

For the total scattering cross section, we have that a B = f y/^dVl — f \f B {0, 0)| 2 dfh For a spherically symmet¬ 
ric potential, we obtain 

2tt 

d£l = 2i r sin OdO — 47r sin(0/2) cos(0/2)d0 = —q dq (6-41) 

Kj 

where we utilized that 2k sin(0/2) = q so that dq — k cos(0/2)d0. a b can then be obtained by integrating over q: 

n f‘2k 

" b = tj 0 v B w 2qdq - (642) 

Since the integral grows as E = h 2 k 2 /2m increases, a B cannot decrease faster than 1/E. More precisely, if the 
integral converges at high energies, one obtains a B oc 1/E. We are treating this problem non-relativistically, so 
"high energies" still means that E <C me 2 . 


When is the Born-approximation valid? 

The iteration procedure that we have utilized is based on the assumption that the incident plane wave is not severly 
altered. In effect, we require that | if>{r) — e lk ' r \ <C 1. Using our expression for in the Born-approximation 
provides: 


1 

47r 


/ 


e ik\r-r'\ 

- -U(r’)e' k r dr 1 

\r — r'\ 


< 1. 


(6.43) 


The modification of the incident wave is expected to be largest at the scattering center r — 0, so the strictest 
requirement is: 


1 

47T 



< 1. 


(6.44) 


If we want an even stricter requirement, we take the absolute value of all factors in the integrand: 



r dr 1 


(6.45) 


where we used that dr — 47 ir 2 dr for U(r) = U(r). When this inequality is satisfied, the Born-approximation is 
expected to be good for all energies. 


Example 11. Bound-state in a constant potential. For a constant potential Vo with range R , the criterion of 
validity takes the form 


m\V 0 \R 2 

h 2 


< l. 


(6.46) 


At the same time, we know that a negative potential — |Vo \ can bind states if m\Vo \R 2 > 7r 2 H 2 /8 from introductory 
QM. This is in agreement with the criterion: we expect the Born-approximation to be valid when | Vo | is sufficiently 
weak to be unable to bind a particle with mass equal to the incident particles. 


For a finite-range potential with finite magnitude, one can always use the Born-approximation at sufficiently high 
energies. To see this, let us go back to the ^-dependent criterion 


1 

47T 



< 1. 


(6.47) 
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Assume now a spherically symmetric potential U(r) = U(r), in which case we can perform the angular integra¬ 
tion: 


This then gives: 



sin v do 


2 sin (kr') 
kr' 


1 

k 



sin (kr')dr' 


< 1. 


Since 


p.i kr' 


sin(/cr / )| < 1, the criterion 


(6.48) 


(6.49) 


f‘OG 

A;» / \U(r)\dr (6.50) 

J o 


is sufficient to guarantee that the original /c-dependent criterion is fulfilled. We conclude that for large enough k, 
we can always fulfill the above equation. Note that using the Born-approximation, we have actually obtained the 
same result for da/dQ for a weak potential V as we did using time dependent perturbation theory in chapter 4: 


da 

dn 


m 
2irh 



2 


e i(Pi-P/)-r/ft dr 


(6.51) 


which is reasonable since the Bom-approximation is good when the potential is weak or the particle energy is high. 


Example 12. Scattering on the Yukawa-potential. Let us apply our scattering framework on a screened 
Coulomb-potential which is known as a Yukawa potential: 

7 7'p 2 

V(r) = (6.52) 

47re 0 r 

Here, a determines the screening radius. Using our derived result for the Born scattering amplitude gives: 


f B (0) = - 


2m ZZ'e 2 
h 2 q 47reo 



sin(gr)dr = — 


2 m 2 

h 


ZZ'e 2 1 

4tt€q a 2 + q 2 


(6.53) 


Using the relations introduced previously: q = 2k sin(0/2), k = p/h = V2mE/h, we can write down the 
differential scattering cross section: 


da B _ , , B | 2 _ /_ ZZ'e 2 /4:7T6 0 _ 

dtt ~ IJ 1 _ \a 2 h 2 {2m)- 1 +4Esin 2 (6/2) 


(6.54) 


It is interesting to note that since d^a B is finite for all angles when a/0, the total a B will also be finite. This 
is in contrast to the classical value a for this potential which becomes infinite. In the limit a 0, we obtain the 
usual unscreened Coulomb-potential. Remarkably, the QM Born-result for d^a = da/dtt is not only identical to 
the classical result, but it is even identical to the exact QM Coulomb cross section! The derivation is not shown 
here, but one finds in the exact treatment that 


/exact (0) = 


2k sin 2 (6/2) 


-2ilnsin(0/2)+i5 


(6.55) 


where we defined 


m ZZ'e 2 
kh 2 47T60 


(6.56) 


and where 6 is a constant (independent on 0). Since \e l6 \ = 1, this phase-factor has no consequence for the cross 
section. However, we will later show that when scattering identical particles on each other, it will have an effect. 
The result that 


^classical _ d(J B _ ^exact 

dfl dQ dQ 


(6.57) 


for the Coulomb-potential must be regarded as a coincidence, since the criteria we listed for the Born- 
approximation are not expected to be valid for the Coulomb-potential. 
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Elastic scattering on atoms. 

If the electrons are fast (energetic), we can treat scattering on neutral atoms via the potential 


V(r) = + 


47reo r 47re ( 


— f 

47Te 0 J 


n(r') 


r — r' 


dr'. 


(6.58) 


This consists of the Coulomb repulsion from the core +Ze in addition to the potential from the electron 
distribution —en(r). Charge neutrality dictates that f n(r)dr = Z. The reason for why the electrons must be 
fast in order for us to use the above potential is that the true antisymmetrized tjj gives a correction to the result 
otherwise. ^ must be antisymmetrized for scattering of electrons on an electron, since these are identical parti¬ 
cles quantum mechanically. Recall that r is the relative coordinate between the potential and the scattered particles. 


We use the Born-approximation, meaning that the incident electron E satisfies 13.6Z 2 eV < E < 500 000 eV: 
it is much larger than the typical potential energy scale, while still non-relativistic ( m e c 2 ~ 0.5 MeV). Now, we 
seek the scattering amplitude f B = — f V(r)e~ iq ' r dr. Introducing s = r — r 1 and using our result for the 
Yukawa-potential without screening (a = 0), we get: 


/ 


g-iq-r 

| r - r' | 


dr 



(6.59) 
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It follows that 


e 2 2m Z — F(q) 
47reo h 2 q 2 


(6.60) 


where F(q) = f n(r)e~ iqr dr is known as the atom form-factor. It is the Fourier-transformation of the electron 
distribution. If n(r) is spherically symmetry, it follows that F(q) = F(q). With q — 2/csin(#/2) and E = 
h 2 k 2 /2m as usual, the differential scattering cross section becomes: 

rj/jB / p2 x 2 

■w = w = (w^e) si ” v/w ~ ( 6 . 61 ) 


By measuring da B /dQ, we can thus obtain information about F(q) and, in turn, the electronic distribution of the 
atom. The idea is thus to scatter a simple particle on a complex structure (potential) to gain info about the complex 
structure. 


Two particular limiting cases are of interest: 


1. When the scattering angle 0 is not small, q = 2/csin(#/2) is sizable when k is large (energetic electrons). 
The result is that F becomes small since the integrand oscillates around zero. Quantitatively, this requires 
that 1/q <C atomic dimensions, i.e. ~ 1 A. If we thus can neglect F compared to Z, d^a is essentially the 
Rutherford cross section. This result makes sense physically: a particle with high E can only scatter a large 
angle 0 if it comes close to the core. 

2. In the opposite regime, for very small angles, we can expand F in powers of q: 

F(q) — j n(r)[l — i q r — -(q • r) 2 + .. .]dr — Z — J r 2 n{r)dr. (6.62) 

The second term oc i q r vanishes due to symmetery. For the third term, we used that the integral with 
q 2 x 2 + q 2 y 2 + q 2 z 2 is 1/3 of the integral with ( q 2 + q 2 + g 2 )r 2 = q 2 r 2 . Define now the average atomic 
radius R\ 

9 f r 2 n(r)dr 1 f 9 / x 7 

R 2 — ^— 7 .———— = — / r 2 n(r)dr. (6.63) 

/ n{r)dr Z J K J J 

This yields Z — F{q) ~ ZR?q 2 /6, which in turn is <C Z for small angles (small q). This means that 


da B ~ / ZR 2 \ 2 
dfl V 3ao / 


(6.64) 


with ao = Aire^h 2 /me 2 . The scattering is then independent on 0 and small when 0 is small. This may 
be physically interpreted as the electron cloud effectively screening the core Ze at small scattering angles 
(classically, this corresponds to a large impact parameter). 


F. The method of partial waves 

So far, we have seen that the Born-approximation is good when E of the incident particle is large. Now, we will 
consider a method which is good in the opposite case, namely the partial wave method which is useful for low 
energies (for instance scattering of sound) and was developed in 1927 by Holtsmark and Faxen. 

Scattering amplitude 

We know that the energy eigenfunctions for a spherically symmetric potential V (r) can be written generally as 

oo l 

ip{r,6,(f)) = EE ci m Ri(,r)Y lm (0, (j>) (6.65) 

/=0 m=—l 

where k 2 — 2mE/h 2 and U(r) = 2mh~ 2 V(r) where R satisfies: 

F(r Rl ) + [lk 2 - U(r) - EtT] {rRl) = o. (6.66) 
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We are interested in E > 0 (continuous part of the spectrum) and cylinder symmetric (no ^-dependence) solutions, 
as is often the case in scattering problems. Choose z as the incident axis. Eq. (6.65) is independent on </> when 
m — 0. The function Y/ m ($, (j)) then reduces to Legendre-polynomials Pi(cos0). These form a complete set for 
cylinder symmetric functions, so that: 


9) ss Y. qRi ( r)Pi (cos 6) (6.67) 

1=0 

where q are constants. We may then also express f(0) — Yl'iZo fiPi( cosd ) where fi are constants. This is the 

announced expansion in "partial waves", each characterized by a quantum number /. Keep in mind that Pi (cos 0) 

* 2 

is an eigenfunction for L with eigenvalues h 2 l (l +1). We recall that the scattering amplitude f(0) is defined from 
the asymptotic behavior: ip(r) — e lkr ~ f(0)~y- valid for r oo. To expand / in partial waves, we must first 
expand ^(r) and e lkr in Legendre polynomials. Start with the incident wave: 


ji kr = e ikrcos0 = J2 d i(kr)Pi(cosO). 
1=0 


Introduce x = cos 0 and use the orthogonality of Pi (cos 9): 


J i Pi(x)P n (x)dx = 


$lr 


( 6 . 68 ) 


(6.69) 


Applied on Eq. (6.68), we obtain by multiplying with P n (x) on both sides and integrating: 


/ l oo 

P n (x)e lkrx dx = / di(kr)Pi(x)P n (x ) dx. 

- 1 i=o J- 1 


Therefore, we see that 


di(kr) = 


+ 1 f 1 Akrx 


e lkrx Pi(x)dx . 


(6.70) 


(6.71) 


We want to see how this behaves for large r. To do so, consider general integrals of the form I = e lsx g(x)dx 
for large s. Consecutive partial integrations, where the exponential function is integrated, provides: 

r piscc i /*1 pisa; r pisx piscc -.1 /*1 pisx 

I =[g(x)—]_i-J_ i g'( X )— dx = [ g{x )—- g ' {x )—]_ i+ jj' {x) — dx (6.72) 


and so forth. Since s is presumed to be large, we obtain smaller and smaller terms. The dominating term for large 
s is then: 

I=[ e isx g(x)dx = g ( 1)|- g{- 1)^7-b£>(s -2 ). (6.73) 

7-1 15 15 

In our case, s — kr and g(x) — Pi(x). Moreover, P/( 1) = 1 and Pi(— 1) = (~1) Z by definition. Hence, the 
asymptotic behavior of di is: 

07 i 1 

P[e ifcr -(-l)'e- ifc H. (6.74) 


Using (—l) z = e 17r/ and 2isiny = e iy — e iy , we rewrite this to di(kr ) ~ (21 + l)i z sm ^ fe ^ r ^^ for large r. We 
have now managed to identify how e lk r is expanded for large r. It remains to find the asymptotic behavior of 
^(r). We have / 0 (r , 0 ) — o c iRi( r )Pi( cos d ) where the equation determining Ri in the limit r -A oo reads: 

d 2 

— (rRi) + k 2 (rRi) = 0. (6.75) 

Hence, we disregarded /(/ + l)/r 2 and U ( r ) [this is fine when U(r) drops faster than 1/r for large r]. The solution 
of Eq. (6.75) is sine and cosine functions. With two arbitrary constants q and Si, we can write the general solution: 

rRi ~ (21 + l)i l ci sin (kr — irl/2 + Si) (6.76) 
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for large r. We have written the solution in this form to look as similar as possible to the expansion of e lkr obtained 
previously. The quantity Si is the phase picked up by the wavefunction as a consequence of the scattering potential 
and is referred to as the l- th scattering phase. In the absence of any potential U (r), one finds Si = 0. To determine 
Si in the general case, one has to solve the radial equation for all r and then inspect Ri for large r. Inserting our 
expansions, we have thus found 

il>(r) — e lkr ~ Pi(cos6)(2l + l)i l [ci sin (kr — lir/2 + Si) — sin (kr — hr/2)\/kr. (6.77) 

In order to finally identify f(0 ), we should now focus on the conditon that the above expression should only 
contain spherical waves of the form e lkr /r according to the asymptotic expression for the wavefunction. This is 
accomplished by noting that: 


[...] = ^7 (cie iSl - 1 y(kr-i*/2) _ i_( C; e _i<5i - 1 ) e - i (^-W 2 ). 

It is clear that we must choose q = e lSl to remove the e~ lkr term, which leaves us with 

\kr 00 


V’(r) - e ik r * - l)n(cos6»). 


(6.78) 


(6.79) 


1=0 


Since e 2l<5z — 1 = 2ie lSl sin Si, we can now identify /(#): 


m = j: + 1 ) eI<5 ‘ sin SiPi(cos9). 

1=0 

where, as usual d^cr = \f(9) | 2 . We have not solved the problem entirely yet, but we have established a connection 
between the solution of the radial equation (i.e. determining Si) and f(0). We will look at a concrete application 
later where Si is determined and hence solving the problem. 
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Total scattering cross section. 

Our expression for f(Q) determines a: 


= f l/WI 

Jo 


2 2 n sin OdO = ^ 
K 


^^(2/ + 1)(2/' + l)e l6t 161 sin Si sin S[ f Pi{x)Pi>(x)dx. (6.80) 
nr J -1 


Using the aforementioned orthogonality of Pi(x), we obtain: 


= Ss( 2/ + i ) sin2<5 (- 


k 2 


(6.81) 


Z=0 


We may note an interesting relation between a and the forward-scattering amplitude (0 = 0). Setting 6 = 0 in 
f(6) = \ (2Z + l)e l6t sin(cos 0) and using that P/(l) = 1, we obtain/(0) = ^ 5^ 0 (2Z + l)e 1( ^ sin5/. 

It follows that we may write generally: 


cr = — Im{/(0)}. 


This relation is known as the optical theorem. We will later give a general proof of the theorem. The fact that /(0) 
appears is related to that in order to cause scattering, the incident beam must be weakened. This is achieved via 
destructive interference between the incident beam and the outgoing forward-scattered beam, described precisely 
via /(0). 

Number of significant phases. 

If particles with momentum hk approach a potential with range R , only particles with angular momentum hkR or 
less should be scattered from a classical perspective. Since the angular momentum size is hy/l(l + 1), we obtain 
that 


^1(1 + 1) < kR. (6.82) 

For low energies kR <C 1 (particle wavelength 2ix/k R ), we see that only l = 0 contributes. In this case, 

/ k~ 1 e l6 ° sin So as only the l = 0 partial wave contributes and we obtain isotropic scattering since 

k~ 2 sin 2 (5o —a = ^ sin 2 5 0 . (6.83) 

ail k z 

This shows why the partial wave method is so useful for low energies. The scattering amplitude has dimension 
length, and the low energy limit for / (which for finite-range potentials is independent on the angle) is often called 
the scattering length a: 


lim f = —a. 

k^o 

This results in a = Att a 2 . To be more specific about what "low energies 

h 2 k 2 h? m /ao\ 2 

2m ^ 2 mR 2 m e \ R/ 

by using the energy expression for the n = 1 Coulomb potential. 


' means, note that kR <C 1 gives: 
13.6 eV 


(6.84) 


(6.85) 


Ramsauer-Townsend effect: sign of the phase-change. 

The phase-change determines the scattering cross section at low energies. sign(£o) is related to the sign of the 
potential V(r). To see this, recall that So was defined by writing the solution of 

~^ 2 u o + “ U(r)]u 0 = 0, (6.86) 

where uo = rRo(r), on the form uo oc sin (hr + So) for large r. If U < 0, the effective wavenumber \/k 2 — U(r ) 
is larger than for U = 0. In turn, this means that the particle wavelength A inside the potential becomes shorter, 
so that uo will have a stronger curvature. The wavefunction is then "pulled" closer to r = 0, corresponding to a 
positive phase-change as shown in the figure. 
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rR 0 (r ) 



Range of potential V (r) 

Conversely, a positive potential gives a negative Sq. Outside of the range of U, the wavelength is of course the 
same in all cases. If the attractive potential (U < 0) is sufficiently strong to "pull in" the partial wave l = 0 to the 
extent that £o = 7r, then sin^o = 7r and a 0: the scattering cross section vanishes. For a given potential, this 
effect (Raumsauer-Townsend) requires a specific energy. It has been experimentally observed, for instance as an 
extremely low minimum in the cross section of electrons scattering on noble gas atoms (Xe, Kr, Ar) at energies 
E~ 0.7 eV. 


Example 13. Low-energy scattering on a hard-sphere potential. Consider a hard-sphere potential with range 
R , such that the wavefunction uo(r) = 0 for r < R while it is a free particle uo{r) oc sin (kr — kR) for r > R. 
As required by continuity, we see that u 0 {r — R) — 0. The phase-shift £o = —kR is thus negative as expected for 
positive (repulsive) potentials. The total cross section contribution from l = 0 valid for any energy is then 

47T 

(Jq = — sin 2 (kR). (6.87) 

For low energies, the partial wave l — 0 gives the dominant contribution. In this case, k 1/R so that 
sin (kR) ~ kR, which gives a ~ 47 t R 2 . It is interesting to note that this QM expression is four times as large 
as the classical limit for this potential, cr c i as sicai = ttR 2 . What is the physical reason for this? We can understand 
this result by realizing that in QM, particles have a wave character. Therefore, the particles will probe the entire 
surface area of the hard spheres rather than just their cross-section, similarly to how water waves would interact 
with an object. For the opposite limit of high energies, one obtains a = 2irR 2 , which still is different from classical- 


Resonant scattering. 

To illustrate this phenomenon, consider low-energy scattering on a well-potential: 


V(r) 


—Vo for r < R 
0 for r > R 


( 6 . 88 ) 


We know by now that at low energies kR <C 1, only the partial wave l = 0 contributes significantly to the cross 
section a, according to a « |f sin 2 £ 0 . The task is to determine £q- To do so, we must relate the solution for 
r < R, uo(r) = Asin(ftr) with n — ^2m{E + Vo)/h 2 , with the solution for r > R, uo(r) = B sm(kr + £ q ) 
with k = yj2mE/h 2 . This is accomplished by continuity of uq and u' Q (usual boundary conditions) at r = R, 
which yields 

k 

tan(£o + kR) — — tan (kR). (6.89) 

n 

Now, neglecting the small term kR compared to we obtain 


sin 2 


tan 2 

1 + tan 2 £ 0 


k 2 tan 2 (kR) 
k 2 + k 2 tan 2 (kR) 


(6.90) 
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The total cross section finally takes the form 


Air 

= k 2 + £ 2 ’ 


where £ = 


hi 

tan (k,R) 


(6.91) 


For a given k, the cross section is maximal when £ = 0, i.e. when kR = (n + l/2)7r and n is an integer. Inserting 
the definition of k, we get: 


E = -Vo + 


ft 2 ^ 2 

2mR 2 


K ) 2 


(6.92) 


Physically, this means that when the incident particle has just the right resonant energy satisfying the above equa¬ 
tion, it will have a tendency to be bound by the potential and remain at r < R, thus causing a major disturbance of 
the wavefunction -» large a. 


G. The optical theorem 

We previously proved the relation a = ^rIm{/(0)} for a spherically symmetric potential V(r). Now, we will 
demonstrate that this theorem is in fact a direct consequence of particle conservation: for a stationary problem, 
the net flux of particles into any volume has to equal the net flux out. Choosing the volume as a sphere of radius 
r centered around the scattering center, this means that f j r r 2 dVt = 0 where j r is the radial probability current 
density. We know that this is given by 


jr = Re{^*— dr^}. 


i m 


(6.93) 


To compute this, we choose r to be so large that we can use the asymptotic expression ip = e lk r + f(0, <fi)e lkr /r. 
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We begin by computing 

r 2 i>*d r Xjj = i kr 2 cos 9 + ifcre- ifer(1 - cos6)) cos Of* + (i kr - i) e ifcr (i-c°s e) f + ^ _ l/ r )|/| 2 . (6.94) 

By now multiplying with h/im , taking the real part of the expression and integrating over all angles, we should 
obtain zero according to f j r r 2 dVt = 0. Now, the last term (|/| 2 ) in Eq. (6.94) becomes purely imaginary and 
gives no contribution. The first term oc cos 6 gives zero upon integration since cos 6 sin 0 = 0. Finally, we also 
get rid off the term oc k\f\ 2 by using that f \f\ 2 dfl = cr. Dividing the remaining terms in the equation by hk/m 
gives: 

Re| [ [ [ re -^r(i-cosG) CQs0f * + ( r + [/k)e ik r^- cos ^^smOdo} + a = 0. (6.95) 

^ 70=0 J 6=0 > 


Introducing cos 6 = x and using that 1 +■ i/kr ~ 1 for large r, this equation becomes 


- rRe U7> 


-\kr-\-\krx 


f*+e' 


ikr—ikrx 


f]dx dcj) j. 


(6.96) 


We now make use of a previously derived result, namely Eq. (6.73). This expansion can be used with s = kr in 
the first term of the l.h.s. in Eq. (6.96) and s = —kr for the second term. The result is 

1 f 2n 47r 

CT = - y 2Im/(0, (j))d(j) = —Im/(0). (6.97) 

We here used that in the forward-scattering direction 6 = 0, there can be no 0-dependence —>> 0-integration merely 
gives a factor 2i r. We have thus proven the optical theorem. We note that: 

1. In contrast to our previous derivation using the method of partial waves, we now did not maky any assump¬ 
tion about the potential being spherically symmetry. 

2. The expression we found in the Born-approximation for a spherically symmetric potential, f B (0) = 

— J 0 °° V{r) sin (qr)rdr is real. This means that it cannot be used in the optical theorem, since it gives 

cr = 0. 

3. We have assumed elastic scattering. If inelastic processes occur, e.g. exciting internal degrees of freedom 
in the particle or fragmentation of particles, the net current through a volume is no longer zero. Instead, it 
must be negative since inelastic scattering processes remove particles from their original state. In this more 
general case, the optical theorem reads 


^el 3“ Cfinel — ^ Im{/ e ] (0)}. 


(6.98) 


H. Lab- and CM-system 


In our scattering theory so far, we have considered a particle scattering on a stationary potential. In fact, this 
corresponds to the center of mass (CM) frame of a two-particle problem with potential V(r\ — 7 * 2 ) since such a 
scenario can be reduced to an effective one-body problem. We now want to analyze the difference between the lab 
and CM frames for two particles scattering off each other, defined in the figure below. 


(a) 



(b) 



0 
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Consider the lab-frame where particle 1 with mass mi and velocity vq scatters on mass m 2 with zero velocity. The 
CM velocity in the lab-frame is then: 


m ivp 

m 1 -\-m 2 ' 


(6.99) 


We seek the relation between the scattering angles 0 and Ol in the CM and lab frame, respectively. The above 
figure shows that: 


tan Ol 


v' sin 0 
v' cos 0 + V' 


( 6 . 100 ) 


Note that v' — \v'\ = vq — V (magnitude of velocity of the scattered and incident particle in the CM frame is the 
same). Inserting v' and V into the above expression for tan Ol yields: 


tan 0 L = --7-• (6.101) 

cos 0 + mi/ m2 

Thus, when the target mass m 2 00, we obtain Ol — 0, as expected. Moreover, for equal masses mi = m 2 , one 
obtains tan Ol — tan 0/2, so that Ol — 0 /2 is maximally 7t/2. 


To identify a relation between the scattering cross sections in the two frames, we will need: 

„ 1 cos 0 + 7 

COS 0 L — 7 — — 7 

\/l + tan 2 Ol y 1 + 27 cos 0 + y 2 

where we defined 7 = mi/m 2 . Moreover, the relation between the solid angles is: 

cIQl sin OlOOl d cosOl l + ycos 0 

dQ sin OdO dcosO (1 + 27 cos 0 + y 2 ) 3 / 2 


( 6 . 102 ) 


(6.103) 


Now, the particle flux incident toward the target should only depend on the relative velocity between 1 and 2 in 
both systems and is thus the same. Also, the same number of particles have to be scattered into dQ and dQ,L'- 
the physics cannot be different by changing referenc frame. Because of the above two facts, it follows from the 
definition of the differential scattering cross section that 

da L {0 L , 0) = dcr( 0 , 0). (6.104) 


Hence, we find that 


d(JL da (1 + 2y COS 0 + y 2 )^/ 2 

dfl l dQ I + 7 cos 0 


(6.105) 


I. Scattering of identical particles 

We now consider what happens when two identical particles are scattered on each other. It is known that the two- 
particle state satisfies - 0 ( 1 , 2 ) = - 0 ( 2 , 1 ) for bosons and - 0 ( 1 , 2 ) = —' 0 ( 2 , 1 ) for fermions where 1 = ( 77 , si) and 
2 = ( 7 * 2 , 52 ). Consider only the spatial part of the wavefunction to begin with and focus on the CM frame. In this 
case, a two-particle state 0 that is symmetric (antisymmetric) in 77 and 7*2 must be an even (odd) function of the 
relative-coordinate r = 77 — 7 * 2 . In spherical coordinates, r —r means that (r, 0, 0) —>> (r, 7r — 0, 0 + 7r). Now, 
our asymptotic wavefunction is neither symmetric nor antisymmetric in the form that we have used it. Therefore, 
for identical particles it must be replaced with 

V>(1,2) = e lk r ± e~ ik r + [f{6) ± /(tt - 0)]e ikr /r. (6.106) 

The upper sign is used for a symmetric wavefunction, and the lower for an antisymmetric wavefunction. Note how 
the spherical part e lkr /r accounts for scattering of particlecs in diamatrically opposite directions. As before, da is 
defined by the ratio of the particle flux into dO and the incident particle stream for one of two plane waves: 

^ = I/(0)±/(tt-0)| 2 - (6.107) 

It makes sense physically that the scattering of both particles must be taken into account when they are identical, 
because we cannot distinguish between the following scenarios shown in the figure. 


Download free eBooks at bookboon.com 


63 













INTERMEDIATE QUANTUM MECHANICS 


QUANTUM MECHANICAL SCATTERING THEORY 


(a) 



(b) 



Eq. (6.107) is consistent with the standard QM treatment: add "wavefunctions", then take absolute value squared 
to compute probabilities. Note that this is different from how we would classically allow for the two possibilities: 
dncr = |/(0)| 2 + |/(tt — 0)| 2 , which has no interference term between /(0) and f(i r — 0). Whether or not to use 
symmetric or antisymmetric states xjj depends on the spin configuration. We now proceed to illustrate this. 


Scattering of spin-0 particles. 

Spin-0 particles: bosons —)> spatially symmetric wavefunction. We thus use the upper sign in d^a. Assume for 
concreteness that the bosons interact via the Coulomb-potential, for which case we have 

fc(0) = — , L /0 , e- 2ilnsin( * /2)+i *- (6.108) 

2/csm (0/2) 


Here, n = Z 2 m/(kaom e ). Inserting this into d^a: 


da 

dQ 


Z 2 e 2 
4tT6q Et 



(0/2) + cos 4 (0/2) + 


2 cos[n In tan 2 (0/2)]' 
sin 2 (0/2) cos 2 (0/2) - 


(6.109) 


where Et = h 2 k 2 /2m. The last term is a purely QM effect stemming from the interference between /(0) and 
f (jt — 0). This effect due to identical particles in QM has been verified experimentally for C 12 scattering on 
carbon [see Phys. Rev. Lett. 4, 365 (I960)]. 


Scattering of particles with spin. 

Even if the interaction between two spinful particles does not depend on the spin itself, we must consider the fact 
that the particles have spin to obtain the correct d^a. To see this, consider e — e scattering (spin 1/2). Now, two 
spin 1/2 states may be combined into one singlet (tl — It) or three triplet (tt> II, tl + It) states. If the particles 
are randomly polarized: probability 1/4 for singlet and probability 3/4 for triplet state. This yields: 

dno- = f(0) - /(7T -6 )| 2 + 1| f{6) + /(7T - 6)\ 2 . (6.110) 

From this, we can infer that particles scattered into 0 = n/2 must be singlets, since the triplet contribution is zero 
for this angle. Moreover, if the spins are not initially random, but fully polarized in the same direction (i.e. triplets), 
there can be no scattering into 0 = 7r/2. 
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VII. MAGNETIC FIELDS IN QUANTUM MECHANICS 

Learning goals. After reading this chapter, the student should: 

• Be able to write down how the presence of a magnetic field, and thus vector potential, can modify the 
Hamiltonian of a system, and what the physical meaning of each corresponding term is. 

• Be able to explain what Landau levels are, how they behave physically, and what type of physical conse¬ 
quences they lead to. 

• Be able to explain what the Aharanov-Bohm effect is and to mathematically outline the basic equations 
describing it. 


We shall consider here the influence of magnetic fields on QM systems. This is extremely important because it is 
one of the simplest and most common experimental ways to manipulate eigenfunctions and energy levels. 


A. Zeeman effect 


Normal Zeeman effect. 

To incorporate a FLfield into the SE, we know from earlier treatment ( e.g . classical mechanics) that we should 
include a gauge field A. For B = Bz , we may use e.g. A = ^ (—y, x , 0). The difference between Hamiltonians 
with and without a magnetic field becomes 


2m 2m m 2m 


qB „ „ 2 

= ~2^ {xp y- yPx) + ^ 


{x 2 +y 2 ). 


Since xp y — yp x = L z is the angular momentum component in the z-direction, we may write 


H' = H — Hq = —-^—BL Z + 
2m 


1 or , y 2 B 2 ^2 , 2 


8m 


(x 2 + y 2 ) = H[ + H' 2 . 


(7.1) 


(7.2) 


Focus now on the term H[ = —fi L B since H 2 is quadratic and negligible for small B. We treat H[ as a 
perturbation and we defined \i L = qL/2m. H[ describes the coupling between external field and the induced 
field of a charged particle with orbital angular momentum. 


For a spherically symmetric potential, R n i (ft) are eigenfunctions for L z with eigenvalue hm. Considering 

an electron (q = —e and m = m e ), the added energy due to H[ becomes 

A E = 6 ^-m = ysBrn (m = —l, ...,/). (7.3) 

2 m e 

We defined the Bohr-magneton ps — eh/2m e . Every energy level is thus split into 21 + 1 levels with a spacing 
depending on B and not on the quantum numbers n or /. This is the normal Zeeman effect, but when we take into 
account spin we obtain the experimentally observed anomalous Zeeman effect. 


Anomalous Zeeman effect. 

A particle with spin has an additional internal angular momentum /i s which also couples to the magnetic field. 
The total perturbation then becomes H[ = (L z + 2S Z ) where we used g s = 2 as the Lande ^-factor. However, 
we must also consider how spin influences Ho , i.e. the B -independent part. This part gains a spin-orbit interaction 

H so = f(r)L • S (7.4) 

so that the eigenfunctions now depend on J 2 and J z where J = L + S, as L and S are no longer conserved 
separately (the Hamiltonian does not commute with either in the presence of H so ). We proceed to distinguish 
beween weak and strong magnetic fields. 
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Strong fields: In this case, we may disregard H so relative the magnetic term H[. The resulting splitting is then 
simply 


cH 

A E — - (rn + 2 m s )B = /i B B(m + 2 m s ). 

2 m e 


(7.5) 


A given energy level is then split into 2/ + 3 levels for / > 0 since m + 2 m s takes values between — l — 1 and 
/ + 1. For l = 0, the splitting is into two levels. 


Weak fields: This is a more complicated situation since we cannot disregard H so anymore. Thus, the eigenstates 
of the unperturbed Hamiltonian are | j,rrij,l). This is because H 0 + H so commutes with both J 2 ,J Z1 and L 2 . 
Thus, the perturbation energy becomes: 


A E = n B B{j,mj,l\L z + 2S z \j,mj, |Z) = mu B B(hmj + (j, mj, l\S z \j, mj, l)). (7.6) 


Here, j and rrij are the quantum numbers determining the eigenvalues for the operators J 2 and J z . For s = 1/2, 
there are two possible values for j: j = l ± 1/2. To compute the expectation value of S z , we want to express 
| j.rrijj) in terms of eigenspinors for S z . This is a length but straightforward calculation which we do not show 
here (see introductory QM course and angular momentum operator algebra), but simply state the final result: 

frrrj ■ 

(j,mj,l\S z \j,mj,l) = (7-7) 

Inserted into AE,we obtain: 

AE = (j = l± i/2, rrij = -j,... ,j). (7.8) 

This gives rise to a different energy splitting with a spacing that is no longer independent on the quantum numbers. 
We show the magnetic field splitting for the hydrogen n = 1 and n = 2 levels in the figure below. 


n = 2 



rrij = 3/2 
1/2 
- 1/2 
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- 1/2 


▲ 


▼ 


A 


We have introduced the notation 25+1 Lj to characterize the levels, where S is the quantum number for total spin 
(1/2 in our case), L is the quantum number for total orbital angular momentum (S : l «= 0, P : l = 1 , D : / =» 
2,. ..), while J is the quantum number for total angular momentum J. Moreover, A = 2 fi B B. For instance,^ 2 Pi/ 2 
then means s = l/2,Z = l,j = 1 /2. 


B. Landau levels 


The Zeeman effect is concerned with the effect of B on bound electrons, such as the coupling between spin S and 
field B. We consider free electrons, neglecting spin for now, and show that for a constant B, the SE can be solved 
exactly. We use a Landau-gauge A = (—By , 0,0) so that the Hamiltonian for q = — e becomes: 


h 2 „ 0 i ehB _ e 2 B 2 2 
-yo x + —y . 


2 m 


2m 


(7.9) 
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This H commutes with p x and p z and thus admits common eigenstates with these operators. The general solution 
should then have the form ip(r) = e lkxX + lkzZ <fi(y). Inserted into Hip = Eip, we obtain the following equation for 


r h 2 k 2 heBk T 


2m 


2 d2 


. 2m 


m 


e z B 

2m 


2u2 


> + 


2m 


This can be written in a more compact manner: 


~^m^" + _ yoU = E<j>. 


■ = E4 >. 


(7.10) 


(7.11) 


where E = E + h 2 k 2 /2m, yo = Hk x /eB , and uj c = eB jm. We see that uj c is the cyclotron frequencey: classical 
angular frequency for the circular motion of an electron in a B field. Now, Eq. (7.11) has a familiar form: a 
harmonic oscillator centered around yo . We immediately know what the eigenvalues are according to our detailed 
previous treatment of such a system: 

E = (n + 1/2 )htv c E=(n + 1/2 )hu c + ti 2 k 2 z /2m (n = 0,1,2 ,...). (7.12) 


The belonging eigenfunctions are: 


V’( r ) = e ika,x+ikzZ 4> n (y - yo) (7.13) 

where (p n is the n-th harmonic oscillator function. The energy E for our particle thus has two parts: free particle 
motion along the B field (z-axis) and quantized motion perpendicularly to B (xy- plane). 

Landau-levels: quantized levels for fixed k z (varying n). 

Landau-bands: continuous energy bands for fixed n (varying k z ). 
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An important aspect is that since the energy does not depend on the quantity yo oc k x , the Landau levels are 
massively degenerate. To see this, consider a large but finite volume V = L x L y L z . Using periodic boundary 
conditions, e.g. + L x ) — ^(x), the allowed values of the momenta are k x — 2pn x /L x and k z = 2pn z /L z 
where rii are integers. Note that using periodic boundary conditions allows us to use free-particle wavefunctions 
to count the number of states in contrast to hard-wall boundary conditions where ^ = 0 at the edges, while still 
obtaining the same density of states. In turn, this means that the allowed values for y 0 = hk x /eB are separated by 
A^/o = h/eBL x . The number of available positions for yo then becomes: 


E y _ j j eB _ <h to t 

* v T~hJ~e 


(7.14) 


Here, <f> tot = L x L y B is the total magnetic flux through the area L x x L y . We may conclude that each Landau 
level contains the same number of states: 4> to t/ (h/e). Each state then carries a flux quantum <I>o = h/e. 


Oscillation of the Fermi level. 

We saw above that the degree of degeneracy of Landau levels was L x L y Be/h per level. Taking spin into account, 
the degeneracy is doubled. 


Thus, if the 2D electron density of the system is meaning there are in total ri 2 L x L y electrons, they can all 
reside in the same Landau level if the field is so strong that 

B>B 0 = \n 2 -. (7.15) 

2 e 

Consider in fact a 2D electron gas, which is typically studied experimentally in the context of Landau levels. We 
thus disregard excitations in the ^-direction. Let us compute the Fermi energy Ep as a function of B. Note 
that: both the degeneracy of Landau levels (LL) and the Landau level energy itself [E n — (n + 1/2 )heB/m\ are 
proportional to B. 

• For B > Bo , all electrons are in the lowest LL so that Ep — \ehB/m. 

• If Boj2 < B < Bq, the electrons that can’t fit into the lowest LL have room to be in the second lowest LL: 
Ep — 3ehB/2m. 

This argument is repeated as B decreases. At B — Bo, Ep jumps from \ehBojm to | ehBo/m. At B — \Bo, 
Ep jumps from | ehBo/m to | ehBo/m, et.c. 

In general: discontinuities at B = B 0 /k where Ep jumps between (1 — ehBo/m and (1 + ehBo/m. 

E f 


ehBp 



-1-1-► B 

\Bo Bo 

For a 3D system, this picture is slightly modified. When B < Bq, the electrons that don’t fit into the Oth LL will 
not directly go into the 1st LL, but instead populate states k z ^ 0 with energy 

1 f) 2 k 2 

E=^hw c +—^ (7.16) 

2 2 m 

This energy will increase until it equals the energy of the 1st LL, and then this level is starting to fill up. We 
still have a sharp peak in Ep vs. B every time a new LL is activated. Since the transport properties of metals 
are determined by the electrons at the Fermi level, the strong variation of Ep vs. B is manifested e.g. in the 
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conductivity p which oscillates with B. This is the Shubnikov-de Haas effect. 


Total energy as a function of magnetic field. 

When B is lowered just below Bq , one electron jumps up to E\ and we now know that Ep should make a jump. 
But what happens to the total system energy? Naively, one might first think that since energy is supplied by the 
external field, we should simply see a decrease in E tot when B is reduced. However, it turns out that the physics is 
a bit more interesting than that. 


Consider a system of size L 2 = L x L y and we have N electrons in total. We have seen that the degeneracy of a 
LL is 2L 2 Boe/h when spin is taken into account. Thus, all e in our system fit into Eq when 2 L 2 B 0 e/h — N 
meaning that Bq = Nh/2L 2 e. Thus, we have the situation shown in the figure when we start out with Bq and then 
lower the field so that one e~ jumps up to E\. 


BEFORE 


AFTER 


(B = Bq) 


(B = B f 0 ) 


E i 


E\ 


Eq 


E' n 


The question is now: what is the change in the total energy of the system, in effect A E — E tot — E t ' ot ? We know 
that Eq(B) — heB/2m and Ei(B) — 3heB/2m. Moreover, the field Bq that forces one electron to leave the 
lowest LL must by definition satisfy 

= N _ 1 _► B' 0 = B 0 - < Bo , (7.17) 

Now, we may evaluate A E\ 

ivh 2 

A E = NEq - [(TV - 1 )E' 0 + E[] = ^^(2 - N). (7.18) 

The energy thus increases if N > 2. If N = 2, there is no change since B' 0 = Bq/2. For large N , the total energy 
will in general oscillate as shown in the figure. 



So energy decreases, but non-monotonically. Note that this picture changes if we account for the Zeeman-splitting 
of the electrons, since it removes the factor 2 in the spin degeneracy of the states. 
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C. Aharonov-Bohm effect 
Wavefunction in space with B = 0. 

Assume that B(r) ^ 0 is present in some region of space, whereas other regions have B = 0. An example is a 
very long coil with a current running through it, which to a very good approximation only has 5^0 inside it. In 
the regions where B = 0, we have A = VA since V x A = 0 there. This means that (up to a constant): 

nr 

A(r) = / A(s) • ds (7.19) 

J r 0 


where is an arbitrary point in the region where B = 0. The integration path is arbitrary, as long as we stay 
inside the B = 0 region. The wavefunction is obtained from the SE: 

ihd t ip = A (^ y _ qa )% + V(rU (7.20) 

2 m V 1 J 

where V ( r ) is potential energy stemming from other effects than the field. The physics must be gauge-invariant. 
Thus, let us perform a gauge-transformation: 

A' = A + Vx, 4>' = <i> - dtx, $ = ^e iqx,h . (7.21) 

Now choose x = — A so that: 

i = 7^ ( j V) 1 V + V(r)ij/. (7.22) 
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This is the same equation as if A = 0 in the first place. However, we cannot in general just set A — 0 in a region 
where B — 0 if the field-free area encloses a region where B / 0. To see this, consider the geometry shown in 
the figure. 



If we integrate along the dashed line, we obtain 

A(s) • ds — 


V x A dS = 


where S is the shaded region and <&s is the flux through it. This equation 
in the white region where B — 0. 


(7.23) 

shows that A cannot be zero everywhere 


Interference experiment. 

Does this mean that an electron moving only in an area with B — 0 can still be affected by the presence of B 0 
in the inaccessible region? Aharanov and Bohm suggested the following interference experiment to clarify this. 
The figure shows an electron source that emits e~ from the point Sq, and the electrons consequently pass through 
the slits 1 and 2 and hit the screen at r\ having taken the paths V\ and V 2 , respectively. 



Assume that the shaded region is completely inaccessible to the electrons. The total wavefunction is a superposition 
of the contribution from paths V\ and 7 * 2 - 

Aot = ipv, (r,t) +ip r2 (r,t). 

According to our previous treatment, we have: 

f>'P 1 — (r, t)e ly[ A ^' ds . 

f>v 2 = V’o (r, t)e lji ^2 A ( s )‘ ds _ 

(r, t) is as before the wavefunction for $ = 0. Note that the relative phase between and is: 

f A(s) • ds — f A(s) • ds = (j) A(s) • ds = <f>. (7.26) 

J'Pl J 


(7.24) 


(7.25) 
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Here, <f> is the flux through the shaded region. We may then rewrite 

iptoi{r, t) = (V’oe ie4>/R + V’oK 1 ^ A(s) ' ds , {1.21) 

so that the probability density of electrons hitting the screen becomes 

l^totl 2 = |V’oe ie * /fi + </’o| 2 . (7.28) 

This means that the interference pattern changes with <f> even if the electrons never move in the shaded region! 
This is the Aharonov-Bohm effect, measured in 1960 by Chambers. We learn therefore that A plays a fundamental 
role. However, note that the physically measurable quantity IV’totl 2 is gauge-independent (only depends on <f> and 
not A). 


D. Flux quantization in superconductors 

An interesting case where electrons move in field-free space is in a superconductor, which besides having zero 
electrical resistance also expels B from its interior. For a superconducting cylinder, a flux <f> can pass through the 
hollow middle, meaning again that A cannot be zero in the superconductor despite B = 0 there. 


Superconducting 

cylinder 




The wavefunction ^ inside the superconductor can again be expressed as the <f> = 0 wavefunction times a phase 
factor: 


V’(r) = ^>o(r)e ^ A ( s )' ds m (7.29) 

If the integral path now is taken to form a closed loop inside the superconductor (as shown in the figure), the 
wavefunction is multiplied with Since the wavefunction has to be single-valued, it follows that 

e —i q*/ h = e 2imr , n = 0, ±1, ±2,... (7.30) 


so that 



The flux has to be quantized. This has been experimentally observed and one found the flux to be quantized in units 
of Tih/e. This corresponds to q = — 2e, suggesting that in superconductors the fundamental entity is an electron 
pair (in accordance with so-called BCS theory). 
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VIII. QUANTIZED RADIATION THEORY 

Learning goals. After reading this chapter, the student should: 

• Be able to qualitatively describe how to quantize the electromagnetic field quantum mechanically. 

• Be able to explain the physical significance of coherent states for the radiation field modes. 

• Be able to schematically write down the state vector for a fully quantized radiation theory and explain what 
spontaneous and stimulated emission means. 


We have treated the EM field classically so far. However, the EM field is also governed by QM and we now want 
to treat both the atomic system and the field quantum mechanically. 


A. Quantization of the radiation field 

The starting point for determining the QM Hamilton-operator of a system is to know the classical Hamiltonian. 
Hence, that is where our investigation begins. 

Classical Hamiltonian for the field. 

Consider the existence of an EM field without any source terms (charges and currents) in a cubic volume V — L 3 
with periodic boundary conditions. V is introduced for convenience so that we can quantify the number of modes 
in the system. 
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It is A that couples the atomic system with the EM field. In a Coulomb gauge with V • A = 0, we may write A as 
a superposition of plane-waves: 


(8.1) 


where uj k — kc and a k x are amplitude factors. Note that the photon A is transversely polarized since e k x _L k , 
guaranteeing that V • A — 0. The sum A goes over discrete wavevectors k = ^ (n x ,n y ,n z ) and over the two 
polarization vectors e kj i and e k ^ (both _L k). Let a k x = akx£~ luJkt for brevity of notation. Now, the energy of 
the field itself is 


H = 1 f [co E 2 + —B 2 ]dr = ^ [ \{d t A ) 2 + c 2 (V x A) 2 ]dr. 
* JV MO ^ Jv 

Inserting our expression for A gives: 


[ e feA • fifc'A' + (fe X e feA ) ■ (fe X e fe 'A')] 

fcA fc'A' 

x / (a feA e ifer - a kx e~ ikr )(a k ' ye ik ' r - a* k , x>e - ik ’ r )dr. 

Jv 

We introduced k — k/\k\. Using the vector identity 

(A x B)(C xD) = (A • C)(B • D) - (A- D)(B • C) 


and 


1 

U 


f e i(fcl-fc2) r dr 


1 if = k2 
0 if 7^ &2 


(8.2) 


(8.3) 


(8.4) 


(8.5) 


since the allowed ^-values are such that an integer number of wavelengths fit into each side L of the volume, we 
see that there is only a contribution to the sum ^2 kk > from k! — ±k. Moreover, since e k x _L k, we actually only 
get a contribution from k! — k since 


1 + k • k 


2 if k' — k 
Oif fc' = -k. 


(8.6) 


Finally, since e k x • e k x> — Sxx' (orthogonal polarization vectors), we obtain in total 

ki = - ^2 ^kiakxalx + a k\ a k\) ^ khj k a kX a kX 

kX kX 


(8.7) 


where we used that a kX is just a scalar amplitude and thus commutes with a k x . It is now useful to introduce the 
real and canonical variables: 


QkX 



(a k x + a kx)i 


PkX 


1 

i 



0 a kX — a kx)- 


To prove that these are indeed canonical, recall that a k x oce luJkt which provides 


(8.8) 


. khj kf * x du 

Qkx = -iV -^-{a k x - a k x) = PkX = 


ku k 2 du 

PkX = -\l ^-(ttfcA + a kX ) = -u k qkX = 


(8.9) 


which are precisely Hamilton’s equations for canonical variables. To show the last equality in each equation, note 
that 


&kX — 


\f 7 2KJ k 


(u k qkx + i Pkx): &kx = 


\f 7 MJ k 


(u k q k x ~ iPkx)- 


( 8 . 10 ) 
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This is a key result: the classical radiation field is formally equivalent to a set of independent harmonic oscillators 
with mass 1 in our variables. 


Quantization. 

Having established the equivalence to harmonic oscillators, the procedure to go quantum mechanical is clear: we 
replace the classical oscillators with QM oscillators. The energy of the system then becomes 


E = X)( nfeA + (s.n) 

k\ 

where rtkx = 0,1, 2,... is the number of photons in the mode (k, A). The corresponding state of the system is the 
product: 

\n kl ,x^ n k2,\2,---) = l«fei,Ai) • \n k2 ,\ 2 ) ■ ■■■ = f[l nfc > A )- (8-12) 

k, A 

Moreover, we introduce (as for the standard QM harmonic oscillator) creation and annihilation operators (a* - , a) 
via: 


QkX = + 4 a)> PkX = T Y^^(«fcA - 4 a)' 


(8.13) 


From the commutator [pkx , qkx\ = fr/U it follows that [a^ a, = 1. The operators for two different modes 
commute, as they are independent of each other. The operators have the known properties (omitting indices for 
brevity): 


a^\n) = y/n + le lujt \n + 1), a\n) = y/ne lujt \n — 1), (8.14) 

so that a) kX creates a photon in the mode ( k , A) while a^x removes one such photon. The number operator N^x = 
a fcA a fcA counts the number of photons in mode (k, A): 

N k x\ ■ ■ ., n k x , • • •) = rikx \ • • -,n k \,...). (8.15) 

The QM operator for the vector potential can now be expressed via creation and annihilation operators: 

<8j6> 

kX 

Calculating the Hamilton-operator in the same way as the classical procedure then yields: 

E -=\^f J {ak\a{ x +al x a k x). (8.17) 

Z kX 

Note that we cannot any longer freely exchange the order of a and a), since they are operators that do not commute. 
Instead, we get: 



Here, E 0 is the ground-state energy (sometimes referred to as the zero-point energy) of the radiation field. Note that 
the generator for the electric field S is obtained via S = —d t A. The constant E 0 can usually simply be removed 
since we can choose the reference level for energy where we like. However, there are interesting exceptions such as 
the Casimir effect. The essence of this phenomenon is that altering the geometry of a system (such as two metallic 
plates) changes the allowed frequency spectrum {oj^a} and thus changes E 0 . If E 0 is reduced, it causes the system 
to try to alter its geometry, leading e.g. to an attraction of the metallic plates. 


B. Coherent states 

If we compute the expectation value of £ in a state \n k x) f° r a mode of the radiation field, we obtain 

(n fc A|£|n fcA } = 0, (8.18) 
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since (n\a\n) = (n\a^\n) = 0. That does not seem very encouraging in terms of correctly describing an EM 
wave. The question becomes: what kind of QM state for the radiation field gives a description which seems more 
reconcilable with the classical picture? 

Our previous treatment of a harmonic oscillator again provides the solution: coherent states \a) = c n\ n ) 

corresponded to a classical oscillation. Due to our established analogy between the QM treatment of the A field 
and a harmonic oscillator, we can in the same way construct coherent photon states for each mode (fe, A). Using 
our previously derived result 


(a\a\a) = ae luJt , (a\a^\a) = a*e lu;t , (8.19) 

it follows that the expectation value of the electric field operator £ in a coherent state for the mode (fe, A) becomes: 

(a\£\a) = ie fcA y^[ae ifc — i ^ t - (8.20) 

The correspondence to a classical harmonic wave becomes more clear if we write a = |a|e 10 : 


(a\£\a) = M sin ( fc • r - u k t + 0). (8.21) 

The importance of coherent states lies not only in the fact that they provide a clear, formal similarity between 
the expectation value of £ in the QM treatment and a classical wave, but also because a monochromatic (fixed 
wavelength) laser can generate such coherent excitations. Thus, these states have direct experimental relevance. 
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An interesting observation is that the standard deviation: 


AS ~ d (£')-(£)* 


( 8 . 22 ) 


from the expectation value is independent of the field amplitude a. One finds AS = ^which thus becomes 
less and less relevant as (£) increases as shown in the figure. 



We can also compute the average number of photons in a coherent state \a) for the mode (fe, A): 

(n) = (a\N kX \a) = |a| 2 . (8.23) 

The entire distribution of photons [the probability P(n) to find the mode excited with n photons] is found in the 
usual way: projecting the total state (a) on the state | n) with n photons, 


P(n) = |(n|a)| 2 — e 


,|2n 


n\ 


l ' 


(8.24) 


The photon-number in a monochromatic coherent state is thus Poisson-distributed: 


P(n) = e -< n >-^ 


(8.25) 


C. Fully quantized radiation theory 

We are now in a position to treat both subsystems which are part of radiation theory (atoms and photons) quantum 
mechanically. We do so using perturbation theory. The unperturbed Hamiltonian Hq contains no interaction 
between the two subsystems. We may then write the total state as the product of independent states for each part. 
It is natural to use the energy states as basis vectors: 

(atomic + photon system) = (atomic system) • (radiation field) = (0) • l^fci,Ai, ^fc 2 ,A 2 5 • • •)• (8.26) 

The two subsystems are coupled via interaction terms such as 

H[ = -—A ■ p and Hn = --S ■ (V x A), (8.27) 

m m 

causing transitions between the unperturbed states. The transition rates can be computed via time-dependent 
perturbation theory. We now consider some examples. 

Spontaneous and stimulated emission. 

Spontaneous (stimulated) emission is the emission of light from an excited atom in the absence (presence) of 
photons in the initial state. Let (fc, A) be wavevector and polarization for the emitted photon. The excited state 
| , 02(f)) has energy E 2 and the final state |0i (£)) has energy E\. Thus, we may write: 

I*) = 1^2) n k \,...), I/) = IV’i) • I • • ■ ,n kx + 1,...}. (8.28) 

Assume that the dominant perturbation term is: 

H[ = — A ■ p = — ^ \l w/ 1 [ak\e lk r + 4 A e“ lfc r ]e fcA ■ P (8.29) 

m m f-' V 2 Ve 0 uik 

k A 
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Matrix elements involving the photon states are simple since only the term with a kX contributes: 

{... ,n kx + 1,.. ,\H[\ ... ,n k \,...) = — ^ -e~ lfcr+lajfc V»fcA + l(efcA -p)- (8.30) 

TTi V 2 V 

This means that (now including the time dependence from | ipi)): 

(.f\H[\i ) = + 1 ) e fcA • (8.31) 

where TVf = ('0i|e _lfc r p|'02)- Using our results derived previously in chapter 4 for the transition rate between 
two states, we obtain: 

9 77- ty 

W 2 _n = —^7 -(n fcA + l)|c fcA • M\ 2 S(E 2 -E 1 - fiu k ). (8.32) 

n m z 2V6 q uok 

The ^-function ensures energy conservation. We can also obtain the total transition rate from atomic state 2 to 
atomic state 1, regardless of the mode of the emitted photon, by performing a summation over all possible modes. 
We use that there are V/{2ix) 3 d 3 k modes with wavevector k in the element d 3 k for a given polarization, which 
yields: 

= 5 £/ Hi;*"" + mEl ~ E ' ~ ' M i 2 - (8J3) 

Using that d 3 k = dk — 2 tt sm0d0k 2 dk and 5(E 2 — E\ — huj^) = £(^21 — hkc) where hw 2 i = E 2 — Ei, we 

obtain 

- E 47 r e 0 m 2 4c3 l sin ^l e fcA-M| 2 (n fcA + l), (8.34) 

where k = uj 2 i/c. The angle 9 specifies the direction of k relative M as shown in the figure. 


e&2 



Spontaneous emission. 

Let us for now focus on the case of spontaneous emission n^x = 0. The summation over polarization directions is 
easy if we assume that e^i lies in the plane spanned by k and M, so that e^ 2 is JL this plane. Since e^ 2 _L M , we 
obtain 


y: \e k \ ■ M I 2 = \e kl ■ M | 2 = sin 2 0\M\ 2 . (8.35) 

A 

Performing the resulting angular integration yields as our final result: 

(836) 

The name "spontaneous emission" was given during a time when one believed that the process was truly not 
caused by any interaction. We now see that this point of view is incorrect: the emission occurs as a result of 
stimulation by the EM field in vacuum. 

Stimulated emission. 

By setting n^x — 0 in the factor (rife a + 1), we omitted the possibility of stimulated emission: the presence of 
photons. A key difference from spontaneous emission is that stimulated photons will have the same direction and 
polarization as the stimulation, rather than being arbitrary. This is a crucial principle behind how a laser works. 
The opposite process, stimulated absorption, has a transition rate proportional to x since: 

\{nkx ~ 1 |<^fcAl^fcA) | 2 = nux- (8.37) 
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IX. DENSITY MATRIX AND QUANTUM STATISTICS 

The lecture notes forming the basis for this chapter follow roughly the same structure as the beginning of the 
corresponding chapters in "Quantum Mechanics" by Bransden & Joachain. 

Learning goals. After reading this chapter, the student should: 

• Be able to describe the difference between mixed states and pure states (in particular a superposition of 
states). 

• Know how to define the density matrix, density operator, and how these quantities can be used to tell apart 
mixed states from pure states and compute expectation values of operators. 

• Be able to qualitatively sketch how information about the polarization of a spin-1/2 system can be obtained 
using the density matrix. 


So far, we have considered quantum systems described by a single wavefunction (or state vector). Such systems 
are said to be in a pure state. These have been assumed prepared in a specific way so that the state vector is 
completely known. We will now study quantum systems that have states which are incompletely known: mixed 
states. Instead of a single wavefunction, one must use a statistical mixture of wavefunctions to describe such 
systems. 

Thus: quantum statistics dealing with such quantum systems as mixed states is the quantum analogue of classical 
statistical mechanics. A crucial fact that must be strongly emphasized is: 

A mixed state is not the same as a superposition of states. 

Let us illustrate this with a concrete example. Two identical boxes A and B contain a large number of spin-1/2 
particles. 



100% of the particles are 50% of the particles are in state |+) z 

in the state -j=(\+) z + |— ) z ) 50% of the particles are in state \—) z 

Which statement is then true? 

1. The boxes are the same: the difference is just semantics. 

2. The boxes are technically different, but experimentally indistinguishable. 

3. The boxes are experimentally different. 

Take a minute to think about this. The correct answer is 3, since A is in a pure state (superposition of states) while 
B is in a mixed state. We can prove this as follows. 


Consider a so-called Stern-Gerlach (S-G) device which effectively measures spin in a given direction. If we use a 
S-G device oriented in the ^-direction, A and B give identical results. However, if the S-G device is oriented in the 
x-direction (so that it measures spin polarized along the x-axis), all particles in box A are measured to be spin-up 
whereas approximately half the particles in box B are measured to be spin-up. The other half in box B is measured 
to be spin-down. In effect, boxes A and B give experimentally different results. This can be understood by noting 
that |+)^ + |—) z is the +ft/2 eigenstate of S x while \+) z and \—) z individually may be written as 50-50 linear 
combinations of the ± ft/2 eigenstates of S x , in effect: 

|_^ (I +)z + I —)z) + (I +)z - I )z) ^ ^ 

In order to be able to distinguish clearly mathematically between pure states (which can be superpositions) and 
mixed states, we will begin by introducing the density matrix formalism. As a concrete application, we shall 
analyze spin-1/2 particles. 


Download free eBooks at bookboon.com 


81 




















INTERMEDIATE QUANTUM MECHANICS 


DENSITY MATRIX AND QUANTUM STATISTICS 


A. The density matrix 

Consider a system consisting of an ensemble (collection) of N sub-systems a = 1, 2,... N. Suppose that each 
sub-system is described by a pure state Using Dirac notation, we denote this pure state by |a). All state 
vectors are assumed normalized to unity, but need not be orthogonal to each other. 

Next, select a complete set of basis vectors | n), i.e. orthonormal eigenvectors of some complete set of operators. 
We then know that ( n'\n) = 8 n ' n and \n)(n\ = 1. We expand the pure state | a) in these basis states \n): 

l Q > = J2 C n ] \ H ) ^ C n“ } = («!«)• (9-2) 


Moreover, since ( a\a) = 1 —> l c n^ | 2 = 1- Consider an observable represented by an operator A. The 

expectation value in state |a): 


(A) a = (a\A\oi) = ^(n\a)(a\ri)(n'\A\n). (9.3) 

nn' 

Now, the average value of A in the ensemble of pure states is called the ensemble (or statistical) average of A and 
is given by: 


N 

(- A) = Y J W a (A) a (9.4) 

a=l 

where W a is the statistical weight of each pure state | a), i.e. the probability of finding the system in this state 
(0 < W a < 1). Clearly, W a = 1. We then have: 

N 

(A) = ' S ^{n\a)W a (a\n')(n'\A\n). (9.5) 

a =1 nn' 

Let us now introduce the density operator p : 


N 

P = L] \ot)W a (a\. (9.6) 

a=l 

Taking matrix elements of the density operator between basis states |n), we obtain the density matrix p in the {n} 
representation whose elements are: 

Pnn' = (n\p\n’) 

= yy (n\a)W a (a\n') = ^ (9.7) 

CK=1 CC = 1 

We emphasize that we are denoting the density operator by p while the density matrix is p. Note that the density 
operator is independent of the choice of the representation, but the density matrix has a different form in different 
representations. We can thus express (A) as follows: 

N 

(A) = Y J Y, W «\- c n'Yc { n\ri'\A\n) 

a=l nn' 

= '^ J { n \p\ n '){ n '\A\n) 

nn' 

= y>iMi»> 

n 

= Tr (pA) (9.8) 

according to our definition of the matrix elements Eq. (9.7). We have then found that: 

Knowing the density matrix enables us to obtain the ensemble average of a quantity A. 
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We see that a normalization condition Tr(p) = 1 is obtained by setting A = 1 (identity operator). If we had pure 
states | a) that were not normalized to unity, then the calculation would have given: 


(A) 


Tr (pA) 
Tr (p) ' 


(9.9) 


The density matrix is Hermitian, as seen from its definition: (n\p\n') = (n'\p\n)*. 


A consequence of this is that we can always diagonalize p by means of a unitary transformation. Its diagonal 
elements p nn = J2a=i W a \c^ | 2 have a simple physical interpretation. They are the probability of finding a 
member of the ensemble in the pure state | n). We also see from the equations that p nn > 0: p is a so-called 
positive semi-definite operator. 

Since Tr(p) = 1 and p nn > 0, it follows that 0 < p nn < 1. Moreover, Tr(p 2 ) < Tr(p) = 1 because of this. 
This relation holds regardless of which representation we write the density matrix in, since Tr is invariant under a 
unitary transformation due to its cyclic property: 

Tr (UpU*) = Tr (UU* p) = Tr (p). (9.10) 

Consider the special case such that the system is in a particular pure states |A). Then, W a = 5 a \ and from our 
definition p = \a)W a (a\, we have p x = p = |A)(A|. This is called a projection operator onto the state |A) 

which satisfies (p x ) 2 = p x —>> Tr[(p A ) 2 ] = Tr(p A ) = 1. 

The equation Tr[(p A ) 2 ] = 1 in fact gives us a criterion for deciding whether a state is pure or not , and this criterion 
is invariant under all unitary transformations since Tr is invariant under these. It also follows that 

Tr (p x A) = ^^(n\p x \n')(n'\A\n) = (A|A|A). (9.11) 

nn' 
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Suppose that we use a representation {\k)} in which p x is diagonal. Then, the above equation is satisfied if: 

Pkk = Sk A- (9.12) 

Then, p x only has one non-vanishing matrix element which is equal to 1 in the A-th row and column. In turn, 
this means that all eigenvalues of the pure state density matrix p x are equal to zero in any representation except 
one eigenvalue which is equal to unity (since eigenvalues don’t change under unitary transformations). This is an 
equivalent way of characterizing a pure state via the density matrix. 

Upon labelling the rows and columns of p with indices n and n f , both n and n' generally refer to a set of indices 
(such as quantum numbers). Often times, however, we are only interested in a particular property of the system, 
such as its spin. We then omit the dependence of p on all other variables, keeping only the relevant spin variables 
and in this manner define a reduced density matrix. 


B. Spin 1/2 system density matrices and polarization 

We shall now apply the general methods presented above on the case of spin-1/2 particles, e.g. a beam of electrons. 
The pure states of a spin-1/2 particle are labelled by momentum eigenvalues ( p X iP y ,Pz ) and the spin projection 
eigenvalues m s h with m s — ±1/2. Let z be the quantization axis. The states are then \p x ,Py,Pz, m s) and the 
density matrix elements are: 


(n\p\n') = (p x ,p v ,p z ,m s \p\p' x ,p' y ,p' z ,m' s ). (9.13) 

The momentum indices are continuous whereas the spin indices are discrete. We focus here on the spin properties 
- disregard the momentum labels and look at the reduced density matrix (m s \p\m' s ), which then is a 2 x 2 matrix 
in spin space. 


Consider two beams of electrons. One beam has N a electrons in the pure state \x a )- The other beam has Nb 
electrons in the pure state |x 6 ). The density operator describing the joint beam is: 

P = W a \x a )(x a \ + W b \x b )(x b \ (9.14) 


where the statistical weights are 


W a = 


Ng 

Na + NC 


W b 


Nt 

Na+N b ' 


We now choose a basis set of two states |xi) and \\ 2 ), for instance the two basic spinors: 


Ixi) 



0 

1 


(9.15) 


(9.16) 


and expand our pure states in terms of these: 

\x a ) = cflxi) + cSIX 2 >, \x b ) = c5 Ixi) + 4| X2 ). (9.17) 

It follows that the density matrix in the {|x*)} representation is given by 

„ = [ W a \ctf + Wb\c\ I 2 WV?4)* + WV54)*1 ,q,ox 

P [WaWdl + w b (c\yc b 2 w a \c a 2 \ 2 +w b \c b 2 \ 2 \- c J 


If our mixture consisted of N\ electrons in the \x a ) = |xi) state and N 2 electrons in the \x b ) = IX 2 ) state, the 
joint beam would be represented by the density operator 


P = W- i|xi)(Xi| +W 2IX2XX2I 


(9.19) 


where W\ — Ni / (N± ± N 2 ) and W 2 — N 2 /(Ni + N 2 ). Since now — c h 2 — l and c\ — c 2 — 0, the density 
matrix becomes diagonal in the { |x®)} representation: 


P = 


W x 0 

0 w 2 


(9.20) 
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Polarization. 

Let p be a general 2x2 density matrix describing a spin-1/2 system. The unit matrix and three Pauli matrices form 
a complete set of 2 x 2 matrices, so we may write in general 

p = a 0 1 + a x (j x + a y Gy + a z (7y m a 0 / + a cr . (9.21) 

Here, {a 0 , a x , a y , a z } are presently unknown parameters and I is the identity matrix. Since we know that 
Tr(p) = 1 is always satisfied, it follows that a 0 = 1/2 by using Tr/=2 and Trcr* = 0. 

The coefficients a*, i = x,y, z give information about the polarization of the mixture of states described by p. To 
see this, first note that (cr*) = Tr(pcr*) as we have previously derived. Inserting our general expression for p and 
using that Tr(cr^oy) = 2£*j, we find (cr*) = 2a*. We can then write: 

p = l -(I + (T -P) (9.22) 

where P = (a) is the polarization vector. Since p is Hermitian, we may always diagonalize it by choosing an 
appropriate set of basis states. We have 


1 

1 + P z Px — i Py 

1 

'1 + P 0 

P= 2 

P X + i Py 1-77 

Aliag — 2 

0 1-P 


where P = ±|P| = P 2 + P 2 + P 2 . We see that in the representation where p is diagonal, one has P x = 

P y = 0 and P — P z . Thus, if we let | X) an( l I i) correspond to the kets for spin-up and spin-down with P along 
the z-axis, we obtain: 


<T-P\t)=P\ t), <r-P\l) = -P I ;)• (9.24) 

We previously established the physical interpretation of the elements p nn : the probability of finding a member of 
the ensemble in the pure state | n). It follows in our case that (1 + P)/2 is the probability of finding in our mixture 
the pure states with spin-up along P. 


This probability may also be expressed as IV+ / (IV+ + IV_) where N± is the number of spin measurements giving 
the value ±h/2 in the P-direction. It follows that 


*< 1±p > 


N± N+-N. 

N+ +N_ N++N^' 


(9.25) 


In effect, we have proven that the polarization P is quite naturally the probability of finding the system in the state 
| |) minus the probability of finding the system in the state | . 


If P = 0, then p = diag(l/2,1/2) and the system is in a completely unpolarized and random state. In contrast, 
we have previously shown that if p 2 = p, the system is in a pure state. When is this the case? We see that: 

p 2 = [1(7 + <r • P )] 2 = 1(7 + 2tr ■ P + P 2 ). (9.26) 

This is equal to | (/ + cr • P) if P 2 = 1, which means that there are two pure states corresponding to P = +1 and 
P = — 1. The physical interpretation is clear: the system is then totally polarized in the direction of P (P = +1) 
or oppositely to P (P = — 1). The corresponding density matrices for pure states with spin projection ±h/2 along 
z is: 


P = 


1 

0 


0 

0 


for P = +1 and p 


0 0 
0 1 


for P = -1. 


(9.27) 


For intermediate values, 0 < i^i < 1, the system is partially polarized. We conclude this analysis of spin- 
1/2 systems by commenting on the number of parameters required to determine the density matrix. From our 
parametrization p — \(I + cr • P), it is clear that the 2 x 2 density matrix for a spin-1/2 mixed state is entirely 
specified by three real independent parameters P = (P x , P yi P z ). In effect, three independent measurements are 
required to determine p for a spin-1/2 system. On the other hand, in the special case of pure states, our previous 
discussion showed that P 2 = 1 is satisfied. This means that only two real independent parameters are required in 
that case since the third is fixed by the condition P 2 = 1. 
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